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(57) Abstract 
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and encoded polypeptides are provided. The nucleic acids and polypeptides disclosed herein are useful as therapeutic and diagnostic agents. 
Agents that selectively bind to the foregoing polypeptides and genes also are provided. 
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CHARACTERIZATION OF A CALCIUM CHANNEL FAMILY 

Field of the Invention 

This invention relates to nucleic acids coding for a novel family of calcium channel 
polypeptides, the encoded polypeptides, unique fragments of the foregoing, and methods of 
5 making and using same. 

Background of the Invention 

Calcium channels are membrane-spanning, multi-subunit proteins that facilitate the 
controlled transport ("flux") of Ca^^ ions into and out of cells. Cells throughout the animeil 
kingdom, and at least some bacterial, fungal and plant cells, possess one or more types of 

io calcium chaimels. In general, "excitable" cells, such as neurons of the central nervous 
system, peripheral nerve cells, and muscle cells, including those of skeletal muscles, cardiac 
muscles, and venous and arterial smooth muscles, possess voltage-dependent calcium 
channels. In a voltage-dependent calcium channel, the transport of Ca^^ ions into and out of 
the cells requires a certain minimal level of depolarization (the difference in potential between 

15 the inside of the cell bearing the channel and the extracellular environment) with the rate of 
Ca^*^ cell flux dependent on the difference in potential. In "non-excitable" cells, calcium 
influx is thought to occur predominantly in response to stimuli which cause the release of 
, calcium from intracellular stores. This process, termed store operated calcium influx, is not 
well understood. 

20 Characterization of a particular type of calcium channel by analysis of whole cells is 

complicated by the presence of mixed populations of different types of calcium channels in 
the majority of cells. Although single-channel recording methods can be used to examine 
individual calcium channels, such analysis does not reveal information related to the 
molecular structure or biochemical composition of the channel. Furthermore, in this type of 

25 analysis, the chaimel is isolated from other cellular constituents that might be important for 
the channers natural functions and pharmacological interactions. To study the calcium 
channel structure-function relationship, large amounts of pure channel protein are needed. 
However, acquiring large amounts of pure protein is difficult in view of the complex nature of 
these multisubunit proteins, the varying concentrations of calcium channel proteins in tissue 

so sources, the presence of mixed populations of calcium channel proteins in tissues, and the 
modifications of the native protein that can occur during the isolation procedure. 



wo 00/40614 PCT/US99y29996 

Summary of the Invention 

The invention is based on the identification of a novel family of calcium channel 
polypeptides and the molecular cloning and partial characterization of a novel member of this 
family that is expressed predominantly in human hematopoietic cells, liver, and kidney. This 
newly identified family of calcium channel polypeptides is designated, "SOC" or "CRAC" or 
"ICRAC", for Store Operated Channels or Calcium Release Activated Channels. Although 
not wishing to be bound to any particular theory or mechanism, it is believed that the 
SOC/CRAC calcium channel polypeptides are transmembrane polypeptides that modulate 
Ca^^ flux "into" and "out of a cell, for example, in certain instances they may be activated 
upon depletion of Ca^^ from intracellular calcium stores, allowing Ca^"^ influx into the cell. 
Accordingly, the compositions disclosed herein are believed to be useful for modulating 
calcium transport into and out of such intracellular stores and for the treatment of disorders 
that are characterized by aberrant calcium transport into and out of such intracellular stores. 
In particular, we believe that the SOC/CRAC calcium channel polypeptides disclosed herein 
play an important role in the influx of extracellular calcium by mediating the refilling of 
intracellular calcium stores following their depletion. Accordingly, we believe that the 
compositions for expressing functional SOC/CRAC calcium chaxmel polypeptides in cells, as 
disclosed herein, are useful for treating patients having conditions that are characterized by 
reduced extracellular calcium influx into their SOC/CRAC-expressing cells. Additionally, the 
compositions of the invention are useful for delivering therapeutic and/or imaging agents to 
cells which preferentially express SOC/CRAC calcium channel polypeptides and, in 
particular, for delivering such agents to hematopoietic cells, liver, heart, spleen, and kidney to 
modulate proliferation and growth of these cells. Moreover, in view of the importance of 
cellular calcium levels to cell viability, we believe that SOC-2/CRAC-1, SOC-3/CRAC-2, and 
SOC-4/CRAC-3 as disclosed herein, and/or other members of the SOC/CRAC family of 
calcium channel polypeptides, represent an ideal target for designing and/or identifying (e.g., 
from molecular libraries) small molecule inhibitors that block lymphocyte proliferation, as 
well as other binding agents that selectively bind to SOC/CRAC polypeptides to which drugs 
or toxins can be conjugated for delivery to SOC/CRAC polypeptide expressing cells. 

The invention is based, in part, on the molecular cloning and sequence analysis of the 
novel SOC/CRAC calcium channel molecules disclosed herein (also referred to as a "SOC- 
2/CRAC-l molecule," a "SOC-3/CRAC.2 molecule," and/or "SOC-4/CRAC-3 molecule") 
that are predominantly expressed in human hematopoietic cells, liver, spleen, heart, and 
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kidney (SOC-2/CRAC-1), kidney and colon (SOC-3/CRAC-2), and prostate (SOC-4/CRAC-3 
molecule). As used herein, a "SOC/CRAC molecule" embraces a "SOC/CRAC calcium 
channel nucleic acid" (or "SOC/CRAC nucleic acid") and a "SOC/CRAC calcium channel 
polypeptide" (or "SOC/CRAC polypeptide"). Homologs and alleles also are embraced within 
the meaning of a SOC/CRAC calcium channel molecule. 

According to one aspect of the invention, isolated SOC/CRAC nucleic acids which 
code for one or more member(s) of the SOC/CRAC family of calcium channel polypeptides 
or unique fragments thereof eire provided. The isolated nucleic acids refer to one or more of 
the following: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic acid 
molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID 
NO:31, and vsdiich code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) vsiiich code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ firom the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

The invention in another aspect provides an isolated nucleic acid molecule selected 
from the group consisting of (a) a unique fragment of a nucleic acid molecule selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:29, and SEQ ID NO:31, (b) complements of (a), 
provided that the unique fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from a sequence group consisting of (1) sequences having 
the SEQ. ID NOS. or GenBank accession numbers of Table I, (2) complements of (1), and (3) 
fragments of (1) and (2). 

According to yet another aspect of the invention, isolated SOC/CRAC polypeptides 
are provided. The isolated SOC/CRAC polypeptide molecules are encoded by one or more 
SOC/CRAC nucleic acid molecules of the invention. Preferably, the SOC/CRAC polypeptide 
contains one or more polypeptides selected from the group consisting of the polypeptides 
having SEQ. ID Nos. 2, 4, 6, 8, 24, 26, 28, 30, and 32. In other embodiments, the isolated 
polypeptide may be a fragment or variant of the foregoing SOC/CRAC polypeptide molecules 
of sufficient length to represent a sequence unique within the hiunan genome, and identifying 
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with a polypeptide that functions as a calcium channel, provided that the fragment excludes a 
sequence of contiguous amino acids identified in Table II, and/or excludes a sequence of 
contiguous amino acids encoded for by a nucleic acid sequence identified in Table I. In 
another embodiment, immunogenic fi-agments of the polypeptide molecules described above 
are provided. 

According to another aspect of the invention, isolated SOC/CRAC binding agents 
(e.g., polypeptides) are provided which selectively bind to a SOC/CRAC molecule (e.g., a 
SOC/CRAC polypeptide encoded by the isolated nucleic acid molecules of the invention). 
Preferably, the isolated binding agents selectively bind to a polypeptide which comprises the 
sequence selected fi-om the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID 
NO: 3 2, or unique fragments thereof. In the preferred embodiments, the isolated binding 
polypeptides include antibodies and fragments of antibodies (e.g.. Fab, F(ab)2, Fd and 
antibody figments which include a CDR3 region which binds selectively to a SOC/CRAC 
polypeptide). Preferably, the antibodies for human therapeutic applications are human 
antibodies. 

According to another aspect of the invention, a pharmaceutical composition containing 
a pharmaceutically effective amount of an isolated SOC/CRAC nucleic acid, an isolated 
SOC/CRAC polypeptide, or an isolated SOC/CRAC binding polypeptide in a 
pharmaceutically acceptable carrier also is provided. The pharmaceutical compositions are 
useful in accordance with therapeutic methods disclosed herein. 

According to yet another aspect of the invention, a method for isolating a SOC/CRAC 
molecule is provided. The method involves: 

a) contacting a SOC/CRAC nucleic acid or a SOC/CRAC binding polypeptide with a 
sample that is believed to contain one or more SOC/CRAC molecules, under conditions to 
form a complex of the SOC/CRAC nucleic acid or the SOC/CRAC binding polypeptide and 
the SOC/CRAC molecule; 

b) detecting the presence of the complex; 

c) isolating the SOC/CRAC molecule from the complex; and 

d) determining whether the isolated SOC/CRAC molecule has SOC/CRAC calcium 
channel activity. As used herein "SOC/CRAC calcium channel activity" refers to the 
transport of Ca^* into and out of intracellular stores that is mediated by a SOC/CRAC 
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poiypeptide. In general, the SOC/CRAC calcium channel activity is initiated by a reduction 
or depletion of intracellular calcium stores. 

In certain embodiments, the SOC/CRAC nucleic acid is a SOC-2/CRAC-1 nucleic 
acid (e.g., a nucleic acid having SEQ. ID NO. 27, or complements thereof); in certain other 
' embodiments, the SOC/CRAC nucleic acid is a SOC-3/CRAC-2 nucleic acid (e.g., a nucleic 
acid having SEQ. ID NO. 29, or complements thereof); in further embodiments, the 
SOC/CRAC nucleic acid is a SOC-4/CRAC-3 nucleic acid (e.g., a nucleic acid having SEQ. 
ID NO. 31, or complements thereof). In yet other embodiments, the SOC/CRAC polypeptide 
is a SOC-2/CRAC-1 binding polypeptide (e.g., an antibody that selectively binds to a SOC- 
2/CRAC-l polypeptide). In yet further embodiments, the SOC/CRAC polypeptide is a SOC- 
3/CRAC-2 binding polypeptide (e.g., an antibody that selectively binds to a SOC-3/CRAC-2 
polypeptide). In some embodiments, the SOC/CRAC polypeptide is a SOC-4/CRAC-3 
binding polypeptide (e.g., an antibody that selectively binds to a SOC-4/CRAC-3 
polypeptide). In the preferred embodiments, the isolated binding polypeptides include 
antibodies and fragments of antibodies (e.g.. Fab, F(ab)2, Fd and antibody fragments which 
include a CDR3 region which binds selectively to a SOC-2/CRAC-1, to a SOC-3/CRAC-2, 
and/or to a SOC-4/CRAC-3 polypeptide). Preferably the isolated binding polji^eptides or 
other binding agents selectively bind to a single SOC/CRAC molecule, i.e., are capable of 
distinguishing between different members of the SOC/CRAC family. Accordingly, one or 
more SOC/CRAC binding agents can be contained in a single composition (e.g., a 
pharmaceutical composition) to identify multiple SOC/CRAC molecules in vivo or in vitro. 

According to yet another aspect of the invention, a method for identifying agents 
useful in the modulation of SOC/CRAC calcium channel activity is provided. The method 
involves: 

a) contacting a SOC/CRAC polypeptide with a candidate agent suspected of 
modulating SOC/CRAC calcium channel activity, under conditions sufficient to allow the 
candidate agent to interact selectively with (e.g. bind to) the SOC/CRAC polypeptide; 

b) detecting a Ca^* concentration of step (b) associated with the SOC/CRAC calcium 
channel activity of the SOC/CRAC polypeptide in the presence of the candidate agent; and 

c) comparing the Ca^* concentration of step (b) with a control Ca^* concentration of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates (increases or decreases) SOC/CRAC calcium channel activity. 
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According to another aspect of the invention, a method for identifying agents useful in 
the modulation of a SOC/CRAC polypeptide kinase activity is provided. The method 
involves: 

a) contacting a SOC/CRAC polypeptide with kinase activity with a candidate agent 
suspected of modulating SOC/CRAC kinase activity, under conditions sufficient to allow the 
candidate agent to interact with the SOC/CRAC polypeptide and modulate its kinase activity; 

b) detecting a kinase activity associated with the SOC/CRAC polypeptide in the 
presence of the candidate agent; and 

c) comparing the kinase activity of step (b) with a control kinase activity of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates (increases or decreases) SOC/CRAC kinase activity. In some 
embodiments the SOC/CRAC polypeptide comprises amino acids 999-1180 of the SOC- 
2/CRAC-l polypeptide (SEQ ID NO:24), or a fragment thereof that retains the kinase activity. 

According to yet another aspect of the invention, a method for determining the level of 
expression of a SOC/CRAC polypeptide in a subject is provided. The method involves: 

a) measuring the expression of a SOC/CRAC polypeptide in a test sample, and 

b) comparing the measured expression of the SOC/CRAC polypeptide in the test 
sample to the expression of a SOC/CRAC polypeptide in a control containing a known level 
of expression to determine the level of SOC/CRAC expression in the subject. Expression is 
defined as SOC/CRAC mRNA expression or SOC/CRAC polypeptide expression. Various 
methods can be used to measure expression. The preferred embodiments of the invention 
utilize PCR and Northem blotting for measuring mRNA expression, and monoclonal or 
polyclonal SOC/CRAC antisera as reagents for measuring SOC/CRAC polypeptide 
expression. In preferred embodiments, the SOC/CRAC molecule (nucleic acid and/or 
polypeptide) is SOC-2/CRAC-1. In other preferred embodiments, the SOC/CRAC molecule 
is SOC-3/CRAC-2. In yet further preferred embodiments, the SOC/CRAC molecule is SOC- 
4/CRAC-3. In certain embodiments, the test samples include biopsy samples and biological 
fluids such as blood. The method is useful, e.g., for assessing the presence or absence or 
stage of a proliferative disorder in a subject. 

The invention also contemplates kits comprising a package including assays for 
SOC/CRAC epitopes, SOC/CRAC nucleic acids, and instructions, and optionally related 
materials such as controls, for example, a number, color chart, or an epitope of the expression 
product of the foregoing isolated nucleic acid molecules of the invention for comparing, for 
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example, the level of SOCVCRAC polypeptides or SOC/CRAC nucleic acid forms (wild-type 
or mutant) in a test sample to the level in a control sample having a known amount of a 
SOC/CRAC nucleic acid or SOC/CRAC polypeptide. This comparison can be used to assess 
in a subject a risk of developing a cancer or the progression of a cancer. The kits may also 
5 include assays for other known genes, and expression products thereof, associated with, for 
example, proliferative disorders (e.g., BRCA, p53, etc.). In a preferred embodiment, the kit 
comprises a package containing: (a) a binding agent that selectively binds to an isolated 
nucleic acid of the invention or an expression product thereof to obtain a measured test value, 
(b) a control containing a known amount of a SOC/CRAC nucleic acid or a SOC/CRAC 
w polypeptide to obtain a measured control value, and (c) instructions for comparing the 
measured test value to the measured control value to determine the amount of SOC/CRAC 
nucleic acid or expression product thereof in a sample. 

The invention provides isolated nucleic acid molecules, unique fragments thereof, 
expression vectors containing the foregoing, and host cells containing the foregoing. The 
IS invention also provides isolated binding polypeptides and binding agents which bind such 
polypeptides, including antibodies, and pharmaceutical compositions containing any of the 
compositions of the invention. The foregoing can be used, inter alia, in the diagnosis or 
treatment of conditions characterized by the aberrant expression levels and/or the presence of 
mutant forms of a SOCYCRAC nucleic acid or polypeptide. The invention also provides 
20 methods for identifying agents that alter the function of the SOC/CRAC polypeptide. 

These and other aspects of the invention, as well as various advantages and utilities, 
will be more apparent with reference to the detailed description of the preferred embodiments. 

Brief Description of the Sequences 
SEQ ID NO: 1 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
25 SEQ ID NO:2 is the predicted amino acid sequence of the translation product of 

human SOC-2/CRAC-1 cDNA (SEQ ID NO:l). 

SEQ ID NO:3 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
SEQ ID NO:4 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:3). 
BO SEQ ID NO:5 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO:6 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:5). 
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SEQ ID NO:7 is a partial nucleotide sequence of the mouse homologue (mSOC- 
2/CRAC-l) of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO:8 is the predicted amino acid sequence of the translation product of the 
mSOC-2/CRAC-l cDNA (SEQ ID NO:7). 

SEQ ID NO:9 is the nucleotide sequence of the mouse MLSN-1 (SOC-1) cDNA. 

SEQ ID NO: 10 is the predicted amino acid sequence of the translation product of the 
mouse MLSN-1 (SOC-1) cDNA (SEQ ID NO:9). 

SEQ ID NO: II is the nucleotide sequence of a human calcium channel cDNA with 
GenBank Acc. no.: AB001535. 

SEQ ID NO: 12 is the predicted amino acid sequence of the translation product of the 
human calcium channel cDNA with GenBank Acc. no.: AB001535 (SEQ ID N0:1 1). 

SEQ ID NO: 13 is the amino acid sequence of a C. Elegans polypeptide at the 
c05cl2.3 locus. 

SEQ ID NO: 14 is the amino acid sequence of a C. Elegans polypeptide at the F54D1 

locus. 

SEQ ID NO:15 is the amino acid sequence of a C. Elegans polypeptide at the tOlHB 

locus. 

SEQ ID NO: 16 is the nucleotide sequence of a mouse kidney cDNA with GenBank 
Acc. no.: AI226731. 

SEQ ID NO: 17 is the predicted amino acid sequence of the translation product of the 
mouse kidney cDNA with GenBank Acc. no.: AI22673 1 (SEQ ID NO: 16). 

SEQ ID NO: 18 is the nucleotide sequence of a human brain cDNA with GenBank 
Acc. no.: H18835. 

SEQ ID NO: 19 is the predicted amino acid sequence of the translation product of the 
human brain cDNA with GenBank Acc, no.: H18835 (SEQ ID NO: 18). 

SEQ ID NO:20 is the nucleotide sequence of the human EST with GenBank Acc. no.: 
AA419592. 

SEQ ID NO:21 is the nucleotide sequence of the human EST with GenBank Acc. no.: 
AA4 19407. 

SEQ ID NO:22 is the nucleotide sequence of the mouse EST with GenBank Acc. no.: 
AI098310. 
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SEQ ID NO:23 is a partial nucleotide sequence of the hximan SOC-2/CRAC-1 cDNA 
that contains the SOC-2/CRAC-1 sequences of SEQ ID NO:l. SEQ ID NO:3, and SEQ ID 
NO:5. 

SEQ ID NO:24 is the predicted amino acid sequence of the translation product of 
5 human SOC-2/CRAC-1 cDNA (SEQ ID NO:23). 

SEQ ID NO:25 is a partial nucleotide sequence of the human SOC-3/CRAC-2 cDNA, 
SEQ ID NO:26 is the predicted amino acid sequence of the translation product of 
human SOC-3/CRAC-2 cDNA (SEQ ID NO:25). 

SEQ ID NO:27 is the full nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
W SEQ ID NO:28 is the predicted amino acid sequence of the translation product of 

human SOC-2/CRAC-1 cDNA (SEQ ID NO:27). 

SEQ ID NO:29 is the full nucleotide sequence of the human SOC-3/CRAC-2 cDNA. 
SEQ ID NO:30 is the predicted amino acid sequence of the translation product of 
human SOC-3/CRAC-2 cDNA (SEQ ID NO:29). 
15 SEQ ID NO:3 1 is the full nucleotide sequence oif the human SOC-4/CRAC-3 cDNA. 

SEQ ID NO:32 is the predicted amino acid sequence of the translation product of 
human SOC-4/CRAC-3 cDNA (SEQ ID NO:31). 

Brief Description of the Drawings 
Figure I is a schematic depicting the intron/exon organization of the chicken SOC- 
20 2/CRAC-l genomic sequence, as well as the putative transmembrane (TM) domains, and the 
targeting constructs utilized in the knockout experiments. 

Detailed Description of the Invention 
One aspect of the invention involves the partial cloning of cDNAs encoding members 
of a novel family of calcium channel polypeptides, referred to herein as "SOC/CRAC* 
25 (designated "SOC" or "CRAC" or "ICRAC", for Store Operated Channels or Calcium 
Release Activated Channels, or CECH). Although not intending to be bound to any particular 
mechanism or theory, we believe that a SOC/CRAC family member is a transmembrane 
calcium chatmel that modulates Ca^"^ flux "^into" and "'out of a cell; in certain instances it may 
be activated upon depletion of Ca^"^ from intracellular calcium stores, allowing Ca^"^ influx 
30 into the cell. 

The first three isolated SOC/CRAC members disclosed herein, define a new family of 
calcium channels which is distinct from previously described calcium channels, such as 
voltage gated calcium channels, ryanodine receptor/inositol-l,4,5-triphosphate receptor 
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channels, and Transient Receptor Potential (TRP) channels. The SOC/CRAC family of 
calcium channels exhibits high selectivity (with a Pca/PNa ratio near 1000), a unitary 
conductance below the detection level of the patch clamp method (the conductance estimated 
at approximately 0.2 picosiemens), and are subject to inhibition by high intracellular calcium 
levels. Although not intending to be bound to any particular mechanism or theory, we believe 
that SOC/CRAC calcium channels are responsible for the majority of, for example, calcium 
entry which occurs when intracellular calcium stores are depleted, and that SOC/CRAC 
currents are important for initiating various types of calcium-dependent processes. Thus, we 
believe that SOC/CRAC calcium channels play an important role in cellular calcium 
homeostasis by, e.g., modulating the supply of calcium to refill intracellular stores when 
depleted. 

The isolated full-length sequence of a representative, first member of the SOC/CRAC 
family, human SOC/CRAC nucleic acid (cDNA), SOC-2/CRAC-1, is represented as the 
nucleic acid of SEQ ID NO:27. This nucleic acid sequence codes for the SOC-2/CRAC-1 
polypeptide with the predicted amino acid sequence disclosed herein as SEQ ID NO:28. A 
homologous mouse cDNA sequence (>90% identity to the human at the nucleotide level) is 
represented as the nucleic acid of SEQ ID NO:7, and codes for a imique fragment of a mouse 
SOC-2/CRAC-1 polypeptide having the predicted, partial amino acid sequence represented as 
SEQ ID NO:8. Analysis of the SOC-2/CRAC-1 partial sequence by comparison to nucleic 
acid and protein databases show that SOC-2/CRAC-1 shares a limited homology to mouse 
MLSN-1 (SOC-1, SEQ ID NOs: 9 and 10). Limited homology is also shared between SOC- 
2/CRAC-l and three C Elegans polypeptides (SEQ ID NOs: 13, 14, and 15). We further 
believe that SOC-2/CRAC-1 plays a role in the regulation of cellular Ca^**^ fluxing and, in 
particular, lymphocyte Ca^"*^ fluxing. 

A second member of the human SOC/CRAC family of calcium channels, SOC- 
3/CRAC-2, is represented as the nucleic acid of SEQ ID NO:29, and codes for the human 
SOC-3/CRAC-2 polypeptide having the predicted amino acid sequence represented as SEQ 
ID NO:30 (this molecule may also be referred to as CECH2). SOC-3/CRAC-2 is 
predominantly expressed in human hematopoietic cells (including peripheral blood 
lymphocytes, liver, bone marrow, spleen, thymus, lymph nodes, heart, and kidney. 
Expression can also be detected (at lesser levels) in brain, skeletal muscle colon, small 
intestiae, placenta, lung, and cells (cell lines) such as HL-60, HeLa, K562, MOLT-4, S W-480, 
A459,andG361. 
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A third member of the human SOC/CRAC family of calcium chamieis, SOC- 
4/CRAC-3, is represented as the nucleic acid of SEQ ID NO:31, and codes for the human 
SOC-4/CRAC-3 polypeptide having the predicted amino acid sequence represented as SEQ 
ID NO:32 (this molecule may also be referred to as CECH6). It specifically expressed in the 
prostate gland/cells. 

As used herein, a SOC/CRAC calcium channel nucleic acid (also referred to herein as 
a "SOC/CRAC nucleic acid" refers to a nucleic acid molecule w^ich: (1) hybridizes under 
stringent conditions to one or more of the nucleic acids having the sequences of SEQ. ID 
NOS. 7, 27, 29, and/or 31 (sequences of the mouse and human SOC-2/CRAC-1, human SOC- 
3/CRAC-2, and human SOC-4/CRAC-3 nucleic acids), and (2) codes for a SOC-2/CRAC-1, a 
SOC-3/CRAC-2 or a SOC-4/CRAC-3 calcium channel polypeptide, respectively, or unique 
fragments of said SOC-2/CRAC-1, SOC-3/CRAC-2, or SOC-4/CRAC-3 polypeptide. 

As used herein, a SOC/CRAC calcium channel polypeptide (also referred to herein as 
a "SOC/CRAC polypeptide") refers to a polypeptide that is coded for by a SOC-2/CRAC-1 , a 
SOC-3/CRAC-2, and/or a SOC-4/CRAC-3 nucleic acid. Preferably, the above-identified 
SOC/CRAC polypeptides mediate transport of calcium into and out of a cell. 

SOC/CRAC polypeptides also are useful as immimogenic molecules for the 
generation of binding polypeptides (e.g., antibodies) which bind selectively to SOC/CRAC 
(e.g., SOC-2/CRAC-1, SOC-3/CRAC-2, and/or SOC-4/CRAC-3) polypeptides. Such 
antibodies can be used in diagnostic assays to identify and/or quantify the presence of a 
SOC/CRAC polypeptide in a sample, such as a biological fluid or biopsy sample. 
SOC/CRAC polypeptides further embrace functionally equivalent fragments, variants, and 
analogs of the preferred SOC/CRAC polypeptides, provided that the fragments, variants, and 
analogs also are useful in mediating calcium transport into and out of intracellular calcium 
stores. 

As used herein, "SOC/CRAC calcium channel activity" refers to Ca^"^ transport ("Ca^^ 
fluxing") across the plasma membrane that is mediated by a SOC/CRAC calcium channel 
polypeptide. The SOC/CRAC calcium channel polypeptide typically has one or more of the 
following properties: high selectivity, a unitary conductance below the detection level of the 
patch clamp method,and are subject to inhibition by high intracellular calcium levels. Such 
activity can be easily detected using standard methodology well known in the art. See, e.g., 
the Examples and Neher, E., "Ion channels for conmiunication between and within cells". 
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Science. 1992 : 256:498-502; and Hoth, M., and Penner, R., "Depletion of intracellular 
calcium stores activates a calcium current in mast cells". Nature. 1992 : 355 (6358):353-6. 

According to one aspect of the invention, isolated nucleic acid molecules which code 
for one or more member(s) of the SOC/CRAC family of calcium channel polypeptides are 
5 provided. The isolated nucleic acid molecules are selected from the foUov^ing groups: 

(a) nucleic acid molecules which hybridize under stringent conditions to one or more 
nucleic acid molecules selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, and SEQ ID NO:31, and which code for a SOC/CRAC polypeptide; 
JO (b) deletions, additions and substitutions of (a) which code for a respective 

SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

75 In certain embodiments, the isolated nucleic acid molecule comprises one or more of 

nucleotides 1-1212 of SEQ IDNO:l; nucleotides 1-739 of SEQ ID NO:3; nucleotides 1-1579 
of SEQ ID NO:5; nucleotides 1-5117 of SEQ ID NO:23; the mouse homolog for SOC- 
2/CRAC-l corresponding to SEQ ID NO:7; nucleotides 1-2180 of SEQ ID NO:25; 
nucleotides 382-5976 of SEQ ID NO:27; nucleotides 73-3714 of SEQ ID NO:29; and 

20 nucleotides 23-3434 of SEQ ID NO:31. In yet other embodiments, the isolated nucleic acid 
molecule comprises a molecule which encodes a polypeptide having one or more sequences 
selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32. 
According to yet another aspect of the invention, an isolated nucleic acid molecule is 

25 provided which is selected from the group consisting of: 

(a) a imique fragment of a nucleic acid molecule selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31, (of sufficient length to represent a 
sequence unique within the himian genome); and (b) complements of (a), provided that the 

30 unique fragment includes a sequence of contiguous nucleotides which is not identical to a 
sequence in the prior art as represented by the sequence group consisting of: (1) sequences 
having the SEQ ID NOs or GenBank accession numbers of Table I, (2) complements of (1), 
and (3) fragments of (1) and (2). 
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In some embodiments, the sequence of contiguous nucleotides is selected from the 
group consisting of (1) at least two contiguous nucleotides nonidentical to the sequence 
group, (2) at least three contiguous nucleotides nonidentical to the sequence group, (3) at least 
four contiguous nucleotides nonidentical to the sequence group, (4) at least five contiguous 
nucleotides nonidentical to the sequence group, (5) at least six contiguous nucleotides 
nonidentical to the sequence group, (6) at least seven contiguous nucleotides nonidentical to 
the sequence group. 

In other embodiments, the unique fragment has a size selected from the group 
consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 14 nucleotides, 16 
nucleotides, 1 8 nucleotides, 20, nucleotides, 22 nucleotides, 24 nucleotides, 26 nucleotides, 
28 nucleotides, 30 nucleotides, 40 nucleotides, 50 nucleotides, 75 nucleotides, 100 
nucleotides, 200 nucleotides, 1000 nucleotides and every integer length therebetween. 

According to another aspect of the invention, expression vectors and host cells 
containing (e.g., transformed or transfected with) expression vectors comprising the nucleic 
acid molecules disclosed herein operably linked to a promoter are provided. In certain 
preferred embodiments, the host cells are eukaryotic cells. 

The isolated nucleic acid molecules disclosed herein have various utilities, including 
their use as probes and primers to identify additional members of the SOC/CRAC family of 
calcium channels, as diagnostic reagents for identifying the presence of SOC/CRAC 
polypeptides in biological or other samples, and as agents for generating SOC/CRAC binding 
polypeptides (e.g., mtibodies) that can be used as reagents in diagnostic and therapeutic 
assays to identify the presence, absence, and/or amounts of a SOC/CRAC nucleic acid or 
polypeptide in a biological or other sample. 

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified 
in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinantly produced by 
cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, 
chemical synthesis. An isolated nucleic acid is one which is readily manipulatable by 
recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained 
in a vector in which 5* and 3' restriction sites are known or for which polymerase chain 
reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid 
sequence existing in its native state in its natural host is not. An isolated nucleic acid may be 
substantially purified, but need not be. For example, a nucleic acid that is isolated within a 
cloning or expression vector is not pure in that it may comprise only a tiny percentage of the 
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material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is 
used herein because it is readily manipulatable by standard techniques known to those of 
ordinary skill in the art. 

As used herein with respect to polypeptides (discussed below), the term "isolated" 
means separated from its native environment in sufficiently pure form so that it can be 
manipulated or used for any one of the purposes of the invention. Thus, isolated means 
sufficiently pure to be used (i) to raise and/or isolate antibodies, (ii) as a reagent in an assay, 
or (iii) for sequencing, etc. 

Homologs and alleles of the SOC/CRAC nucleic acids of the invention can be 
identified by conventional techniques. Thus, an aspect of the invention is those nucleic acid 
sequences which code for SOC/CRAC polypeptides and which hybridize to a nucleic acid 
molecule selected from a group consisting of the nucleic acid of SEQ ID NO:l, the nucleic 
acid of SEQ ID NO:3, the nucleic acid of SEQ ID NO:5, the nucleic acid of SEQ ID NO:7, 
the nucleic acid of SEQ ID NO:23, the nucleic acid of SEQ ID NO:25, the nucleic acid of 
SEQ ID NO:27, the nucleic acid of SEQ ID NO:29, and the nucleic acid of SEQ ID NO:31, 
under stringent conditions. The term "stringent conditions" as used herein refers to 
parameters with which the art is familiar. Nucleic acid hybridization parameters may be 
found in references which compile such methods, e.g. Molecular Cloning: A Laboratory 
Manual, J. Sambrook, et al., eds.. Second Edition, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1989, or Current Protocols in Molecular Biology ^ F.M. Ausubel, 
et al., eds., John Wiley & Sons, Inc., New York. More specifically, stringent conditions, as 
used herein, refers, for example, to hybridization at 65*'C in hybridization buffer (3.5 x SSC, 
0.02% Ficoll, 0.02% polyvinyl pyrolidone, 0.02% Bovine Serum Albumin, 2.5mM 
NaH2P04(pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium chloride/0. 15M sodium 
citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is ethylenediaminetetracetic acid. 
After hybridization, the membrane upon which the DNA is transferred is washed at 2 x SSC 
at room temperature and then at 0.1 x SSC/0.1 x SDS at temperatures up to 68°C. 

There are other conditions, reagents, and so forth which can be used, and would result 
in a similar degree of stringency. The skilled artisan will be familiar with such conditions, 
and thus they are not given here. It will be understood, however, that the skilled artisan will 
be able to manipulate the conditions in a manner to permit the clear identification of 
homologs and alleles of the SOC/CRAC nucleic acids of the invention. The skilled artisan 
also is familiar with the methodology for screening cells and libraries for expression of such 
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molecules which then are routinely isolated, followed by isolation of the pertinent nucleic 
acid molecule and sequencing. 

In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and/or SEQ ID 
NO:31, and SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, respectively. In 
some instances sequences will share at least 50% nucleotide identity and/or at least 65% 
amino acid identity and in still other instances sequences will share at least 60% nucleotide 
identity and/or at least 75% amino acid identity. The homology can be calculated using 
various, publicly available software tools developed by NCBI (Bethesda, Maryland) that can 
be obtained through the internet (ftp:/ncbi. nlm.nih.gov/pub/). Exemplary tools include the 
BLAST system available at http://www.ncbi.nlm.nih.gov. Pairwise and ClustalW alignments 
(BLOSUM30 matrix setting) as well as Kyte-Doolittle hydropathic analysis can be obtained 
using the MacVetor sequence analysis software (Oxford Molecular Group). Watson-Crick 
complements of the foregoing nucleic acids also are embraced by the invention. 

In screening for SOC/CRAC related genes, such as homologs and alleles of SOC- 
2/CRAC-l and/or SOC-3/CRAC-2, a Southem blot may be performed using the foregoing 
conditions, together with a radioactive probe. After washing the membrane to which the 
DNA is finally transferred, the membrane can be placed against X-ray film or a 
phosphoimager plate to detect the radioactive signal. 

Given that the expression of the SOC/CRAC gene is prominent in certain himian 
tissues (e.g., SOC-2/CRAC-1: lymphoid tissue/heart, SOC-3/CRAC-2: kidney/colon, SOC- 
4/CRAC-3: prostate), and given the teachings herein of partial human SOC/CRAC cDNA 
clones, full-length and other mammalian sequences corresponding to the human SOC/CRAC 
partial nucleic acid sequences can be isolated from, for example, a cDNA library prepEU'ed 
from one or more of the tissues in which SOC-2/CRAC-1 expression is prominent, SOC- 
3/CRAC-2 is prominent, and/or SOC-4/CRAC-3 expression is prominent, using standard 
colony hybridization techniques. 

The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
purposes of encoding a serine residue. Thus, it vnll be apparent to one of ordinary skill in the 
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art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vivo, to incorporate a serine residue into an elongating 
SOC/CRAC polypeptide. Similarly, nucleotide sequence triplets which encode other amino 
acid residues include, but are not limited to: CCA, CCC, CCG and CCT (proline codons); 
CGA, CGC, COG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACQ and ACT 
(threonine codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT (isoleucine 
codons). Other amino acid residues may be encoded similarly by multiple nucleotide 
sequences. Thus, the invention embraces degenerate nucleic acids that differ from the 
biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic 
code. 

The invention also provides isolated unique fragments of an isolated nucleic acid 
molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID 
NO:31. A unique fragment is one that is a 'signature^ for the larger nucleic acid. For 
example, the unique fragment is long enough to assure that its precise sequence is not found 
in molecules within the himian genome outside of the SOC/CRAC nucleic acids defined 
above (and hiunan alleles). Those of ordinary skill in the art may apply no more than routine 
procedures to determine if a fragment is imique within the human genome. 

Unique fragments, however, exclude fragments completely composed of the 
nucleotide sequences of any of GenBank accession numbers and SEQ ID NOs listed in Table 
I (SEQ ID NO:9, AB001535, AI226731, H18835, AA419592, AA261842, AA419407, 
AI098310, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, AC005538, AA654650, AA370110, AA313170, 
AA493512, AI670079, AI671853, AC005538, AA654650, AA370110, AA313170, 
AA493512, AI670079, AJ671853), or other previously published sequences as of the filing 
date of this application. 

A fragment which is completely composed of the sequence described in the foregoing 
GenBank deposits and SEQ ID NO:9, is one which does not include any of the nucleotides 
unique to the sequences of the invention. Thus, a unique fiagment must contain a nucleotide 
sequence other than the exact sequence of those in GenBank or fragments thereof. The 
difference may be an addition, deletion or substitution with respect to the GenBank sequence 
or it may be a sequence wholly separate from the GenBank sequence. 
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Unique fragments can be used as probes in Southern and Northern blot assays to 
identify such nucleic acids, or can be used in amplification assays such as those employing 
PCR. As known to those skilled in the art, large probes such as 200, 250, 300 or more 
nucleotides are preferred for certain uses such as Southern and Northern blots, while smaller 
fragments will be preferred for uses such as PCR. Unique fragments also can be used to 
produce fusion proteins for generating antibodies or determining binding of the polypeptide 
fragments, as demonstrated in the Exsmiples, or for generating immunoassay components. 
Likewise, unique fragments can be employed to produce nonfused fragments of the 
SOC/CRAC polypeptides, useful, for example, in the preparation of antibodies, 
inmiunoassays or therapeutic applications. Unique fragments further can be used as antisense 
molecules to inhibit the expression of SOC/CRAC nucleic acids and polypeptides, 
respectively. 

As will be recognized by those skilled in the art, the size of the unique frs^ment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:l, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, and SEQ ID NO:31, and complements thereof, will require longer segments 
to be unique while others will require only short segments, typically between 12 and 32 
nucleotides long (e.g. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31 and 32 bases) or more, up to the entire length of the disclosed sequence. As mentioned 
above, this disclosure intends to embrace each and every fragment of each sequence, 
beginning at the first nucleotide, the second nucleotide and so on, up to 8 nucleotides short of 
the end, and ending anywhere from nucleotide niunber 8, 9, 10 and so on for each sequence, 
up to the very last nucleotide, (provided the sequence is imique as described above). Virtually 
any segment of the region of SEQ ID NO:l beginning at nucleotide 1 and ending at 
nucleotide 1212, or SEQ ID NO:3 beginning at nucleotide 1 and ending at nucleotide 739, or 
SEQ ID NO:5 beginning at nucleotide 1 and ending at nucleotide 1579, or SEQ ID NO:7 
beginning at nucleotide 1 and ending at nucleotide 3532, or SEQ ID NO:23 beginning at 
nucleotide 1 and ending at nucleotide 5117, SEQ ID NO:25 beginning at nucleotide 1 and 
ending at nucleotide 21 80, SEQ ID NO:27 beginning at nucleotide 1 and ending at nucleotide 
7419, or SEQ ID NO:29 beginning at nucleotide 1 and ending at nucleotide 4061, or SEQ ID 
NO:31 beginning at nucleotide 1 and ending at nucleotide 4646, or complements thereof, that 
is 20 or more nucleotides in length will be unique. Those skilled in the art are well versed in 
methods for selecting such sequences, typically on the basis of the ability of the unique 
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fragment to selectively distinguish the sequence of interest from other sequences in the human 
genome of the fragment to those on known databases typiceilly is all that is necessary, 
although in vitro confirmatory hybridization and sequencing analysis may be performed. 

As mentioned above, the invention embraces antisense oligonucleotides that 
selectively bind to a nucleic acid molecule encoding a SOC/CRAC polypeptide, to decrease 
SOC/CRAC calcium channel activity. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
oligonucleotide that is an oiigoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under 
physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript. Those 
skilled in the art will recognize that the exact length of the antisense oligonucleotide and its 
degree of complementarity with its target will depend upon the specific target selected, 
including the sequence of the target and the particular bases which comprise that sequence. It 
is preferred that the antisense oligonucleotide be constructed and arranged so as to bind 
selectively with the target under physiological conditions, i.e., to hybridize substantially more 
to the target sequence than to any other sequence in the target cell under physiological 
conditions. Based upon SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31, or upon allelic 
or homologous genomic and/or cDNA sequences, one of skill in the art can easily choose and 
synthesize any of a number of appropriate antisense molecules for use in accordance with the 
present invention. In order to be sufficiently selective and potent for inhibition, such 
antisense oligonucleotides should comprise at least 10 and, more preferably, at least 15 
consecutive bases which are complementary to the target, although in certain cases modified 
oligonucleotides as short as 7 bases in length have been used successfixlly as antisense 
oligonucleotides (Wagner et al., NaL Med. 1(1 1):1 116-1 118, 1995). Most preferably, the 
antisense oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal 
or 5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
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addition, 3 '-untranslated regions may be targeted by antisense oligonucleotides. Targeting to 
mRNA splicing sites has also been used in the art but may be less preferred if alternative 
mRNA splicing occurs. In addition, the antisense is targeted, preferably, to sites in which 
mRNA secondary structure is not expected (see, e.g., Sainio et ai.. Cell Mol. Neurobiol. 
14(5):439-457, 1994) and at which proteins are not expected to bind. Finally, although, SEQ 
ID No:l discloses a cDNA sequence, one of ordinary skill in the art may easily derive the 
genomic DNA corresponding to this sequence. Thus, the present invention also provides for 
antisense oligonucleotides which are complementary to the genomic DNA corresponding to 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5,SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31. Similarly, antisense to allelic or 
homologous SOC/CRAC cDNAs and genomic DNAs are enabled without undue 
experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural*' deoxyribonucleotides, ribonucleotides, or any combination thereof. 
That is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out 
manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention 
also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a number of ways which do not prevent them from hybridizing to their target but which 
enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one 
nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group not normally 
associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred 
synthetic intemucleoside linkages are phosphorothioates, alkylphosphonates, 
phosphorodithioates, phosphate esters, aUcylphosphonothioates, phosphoramidates, 
carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
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oligonucleotides having backbone sugars which are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3' position and other than a 
phosphate group at the 5' position. Thus modified oligonucleotides may include a 2'-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 
arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 
preparations containing modified antisense molecules that are complementary to and 
hybridizable with, under physiological conditions, nucleic acids encoding SOC/CRAC 
polypeptides, together with pharmaceutically acceptable carriers. Antisense oligonucleotides 
may be administered as part of a pharmaceutical composition. Such a pharmaceutical 
composition may include the antisense oligonucleotides in combination with any standard 
physiologically and/or pharmaceutically acceptable carriers which are known in the art. The 
compositions should be sterile and contain a therapeutically effective amount of the antisense 
oligonucleotides in a unit of weight or volume suitable for administmtion to a patient. The 
term "pharmaceutically acceptable" means a non-toxic material that does not interfere with 
the effectiveness of the biological activity of tiie active ingredients. The term 
"physiologically acceptable" refers to a non-toxic material that is compatible with a biological 
system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will 
depend on the route of administration. Physiologically and pharmaceutically acceptable 
carriers include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials 
which are well known in the art. 

The invention also involves expression vectors coding for SOC/CRAC proteins and 
fragments and variants thereof and host cells containing those expression vectors. Virtually 
any cells, prokaryotic or eukaryotic, which can be transformed with heterologous DNA or 
RNA and which can be grown or maintained in culture, may be used in the practice of the 
invention. Examples include bacterial cells such as E.coli and eukaryotic cells such as mouse, 
hamster, pig, goat, primate, yeast, xenopous, etc. They may be of a wide variety of tissue 
types, including mast cells, fibroblasts, oocytes and lymphocytes, and they may be primary 
cells or cell lines. Specific examples include CHO cells and COS cells. Cell-fi'ee 
transcription systems also may be used in lieu of cells. 

As used herein, a "vector" may be any of a number of nucleic acids into which a 
desired sequence may be inserted by restriction and ligation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of 
DNA although RNA vectors are also available. Vectors include, but are not limited to. 
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plasmids, phagemids and virus genomes. A cloning vector is one which is able to replicate in 
a host cell, and which is further characterized by one or more endonuclease restriction sites at 
which the vector may be cut in a determinable fashion and into which a desired DNA 
sequence may be ligated such that the new recombinant vector retains its ability to replicate in 
the host cell. In the case of plasmids, replication of the desired sequence may occur many 
times as the plasmid increases in copy number within the host bacterium or just a single time 
per host before the host reproduces by mitosis. In the case of phage, replication may occur 
actively during a lytic phase or passively during a lysogenic phase. An expression vector is 
one into which a desired DNA sequence may be inserted by restriction and ligation such that 
it is operably joined to regulatory sequences and may be expressed as an RNA transcript. 
Vectors may further contain one or more marker sequences suitable for use in the 
identification of cells which have or have not been transformed or transfected with the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes 
whose activities are detectable by standard assays known in the art (e.g., p-galactosidase or 
alkaline phosphatase), and genes which visibly affect the phenotype of transformed or 
transfected cells, hosts, colonies or plaques (e.g., green fluorescent protein). Preferred vectors 
are those capable of autonomous replication and expression of the structural gene products 
present in the DNA segments to which they are operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked in such a way as to place the expression or 
transcription of the coding sequence under the influence or control of the regulatory 
sequences. If it is desired that the coding sequences be translated into a functional protein, 
two DNA sequences are said to be operably joined if induction of a promoter in the 5' 
regulatory sequences results in the transcription of the coding sequence and if the nature of 
the linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region to direct the transcription 
of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript 
to be translated into a protein. Thus, a promoter region would be operably joined to a coding 
sequence if the promoter region were capable of effecting transcription of that DNA sequence 
such that the resulting transcript might be translated into the desired protein or polypeptide. 

The precise nature of the regtdatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribed 
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and 5' non-translated sequences involved with the initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. 
Especially, such 5' non-transcribed regulatory sequences v^U include a promoter region 
which includes a promoter sequence for transcriptional control of the operably joined gene. 
Regulatory sequences may also include enhancer sequences or upstream activator sequences 
as desired. The vectors of the invention may optionally include 5' leader or signal sequences. 
The choice and design of an appropriate vector is within the ability and discretion of one of 
ordinary skill in the art. 

According to yet another aspect of the invention, isolated SOC/CRAC polypeptides 
are provided. Preferably, the isolated SOC/CRAC polypeptides are encoded by the isolated 
SOC/CRAC nucleic acid molecules disclosed herein. More preferably, the isolated 
SOC/CRAC polypeptides of the invention are encoded by the nucleic acid molecules having 
SEQ ID Nos. 1, 3, 5, 7, 23, 25, 27, 29, and 31. In yet other embodiments, the isolated 
SOC/CRAC polypeptides of the invention have an amino acid sequence selected from the 
group consisting of SEQ ID Nos. 2, 4, 6, 8, 24, 26, 28, 30 and 32. Preferably, the isolated 
SOC/CRAC polypeptides are of sufficient length to represent a sequence unique within the 
hmnan genome. Thus, the preferred embodiments include a sequence of contiguous amino 
acids which is not identical to a prior art sequence as represented by the sequence group 
consisting of the contiguous amino acids identified in Table II (SEQ ID NO: 10, SEQ ID 
NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19 and GenBank Acc. 
Nos. AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP002411, CAA92726, 
CAB05572). 

In certain embodiments, the isolated SOC/CRAC polypeptides are immunogenic and 
can be used to generate binding polypeptides (e.g., antibodies) for use in diagnostic and 
therapeutic applications. Such binding polypeptides also are useful for detecting the presence, 
absence, and/or amounts of a SOC/CRAC nucleic acid or polypeptide in a sample such as a 
biological fluid or biopsy sample. Preferably, the SOC/CRAC polypeptides that are useful for 
generating binding polypeptides are unique polypeptides and, therefore, binding of the 
antibody to a SOC/CRAC polypeptide in a sample is selective for the SOC/CRAC 
polypeptide. 

Expression vectors containing all the necessary elements for expression are 
commercially available and known to those skilled in the art. See, e.g., Sambrook et al.. 
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Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory 
Press, 1989. Cells are genetically engineered by the introduction into the cells of 
heterologous DNA (RNA) encoding a SOC/CRAC polypeptide or fragment or variant thereof. 
The heterologous DNA (RNA) is placed under operable control of transcriptional elements to 
permit the expression of the heterologous DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as 
pRc/CMV (available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as 
a gene that confers G418 resistance (which facilitates the selection of stably transfected cell 
lines) and the himian cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, 
suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen, 
Carlsbad, CA), v^hich contains an Epstein Barr virus (EBV) origin of replication, facilitating 
the maintenance of plasmid as a multicopy extrachromosomal element. Another expression 
vector is the pEF-BQS plasmid containing the promoter of polypeptide Elongation Factor la, 
which stimulates efficiently transcription in vitro. The plasmid is described by Mishizuma 
and Nagata (Nuc. Acids Res. 18:5322, 1990), and its use in transfection experiments is 
disclosed by, for example, Demoulin (Afo/. Cell. BioL 16:4710-4716, 1996). Still another 
preferred expression vector is an adenovirus, described by Stratford-Perricaudet, which is 
defective for El and E3 proteins {J. Clin. Invest. 90:626-630, 1992). The use of the 
adenovirus as an Adeno.PlA recombinant is disclosed by Warmer et al., in intradermal 
injection in mice for immunization against PI A (Jnt. J. Cancer, 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to 
prepare a desired expression vector or vectors. Such expression kits include at least separate 
portions of each of the previously discussed coding sequences. Other components may be 
added, as desired, as long as the previously mentioned sequences, which are required, are 
included. 

It will also be recognized that the invention embraces the use of the above described, 
SOC/CRAC cDNA sequence containing expression vectors, to transfect host cells and cell 
lines, by these prokaryotic (e.g., E. coli) , or eukaryotic (e.g., CHO cells, COS cells, yeast 
expression systems and recombinant baculovirus expression in insect cells). Especi£dly useful 
are mammalian cells such as mouse, hamster, pig, goat, primate, etc. They may be of a wide 
variety of tissue types, and include primary cells and cell lines. Specific examples include 
dendritic cells, U293 cells, peripheral blood leukocytes, bone marrow stem cells and 
embryonic stem cells. The invention also permits the construction of SOC/CRAC gene 
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"knock-outs'' in cells and in animals, providing materials for studying certain aspects of 
SOC/CRAC calcium channel activity. 

The invention also provides isolated polypeptides (including whole proteins and 
partial proteins), encoded by the foregoing SOC/CRAC nucleic acids, and include the 
polypeptides of SEQ ID NO:2, 4, 6, 8, 24, 26, 28, 30, 32, and unique fragments thereof. Such 
polypeptides are useful, for example, to regulate calcium transport-mediated cell growth, 
differentiation and proliferation, to generate antibodies, as components of immunoassays, etc. 
Polypeptides can be isolated from biological samples including tissue or cell homogenates, 
and can also be expressed recombinantly in a variety of prokaryotic and eukaryotic expression 
systems by constructing an expression vector appropriate to the expression system, 
introducing the expression vector into the expression system, and isolating the recombinantly 
expressed protein. Short polypeptides, including antigenic peptides (such as are presented by 
MHC molecules on the surface of a cell for immune recognition) also can be synthesized 
chemically using well-established methods of peptide synthesis. 

A unique fragment of a SOC/CRAC polypeptide, in general, has the features and 
characteristics of unique fragments as discussed above in connection with nucleic acids. As 
will be recognized by those skilled in the art, the size of the unique fragment will depend 
upon factors such as whether the fragment constitutes a portion of a conserved protein 
domain. Thus, some regions of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 
SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, will 
require longer segments to be unique while others will require only short segments, typically 
between 5 and 12 amino acids (e.g. 5, 6, 7, 8, 9, 10, 11 and 12 amino acids long or more, 
including each integer up to the full length, > 1,000 amino acids long). Virtually any segment 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, excluding the ones that share 
identity witii it (the polypeptides identified in Table II - SEQ ID NO: 10, SEQ ID NO: 13, SEQ 
ID NO:14, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, and GenBank Acc. Nos. 
AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP002411, CAA92726, 
CAB0SS72) that is 9 or more amino acids in length will be unique. 

Unique fragments of a polypeptide preferably are those fragments which retain a 
distinct functional capability of the polypeptide. Functional capabilities which can be retained 
in a unique fragment of a polypeptide include Ca^^ fluxing, high selectivity, a unitary 
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conductance below the detection level of the patch clamp method, and/or and are subject to 
inhibition by high intracellular calcium levels. 

One important aspect of a unique fragment is its ability to act as a signature for 
identifying the polypeptide. Optionally, another aspect of a unique fragment is its ability to 
provide an immune response in an animal. Those skilled in the art are well versed in methods 
for selecting unique amino acid sequences, typically on the basis of the ability of the unique 
fragment to selectively distinguish the sequence of interest from non-family members. A 
comparison of the sequence of the fragment to those on known databases typically is all that 
is necessary. 

The invention embraces variants of the SOC/CRAC polypeptides described above. As 
used herein, a "variant" of a SOC/CRAC polypeptide is a polypeptide which contains one or 
more modifications to the primary amino acid sequence of a SOC/CRAC polypeptide. 
Modifications which create a SOC/CRAC polypeptide variant are typically made to the 
nucleic acid which encodes the SOC/CRAC polypeptide, and can include deletions, point 
mutations, truncations, amino acid substitutions and addition of amino acids or non-amino 
acid moieties to: 1) reduce or eliminate a calcium channel activity of a SOC/CRAC 
polypeptide; 2) enhance a property of a SOC/CRAC polypeptide, such as protein stability in 
an expression system or the stability of protein-protein binding; 3) provide a novel activity or 
property to a SOC/CRAC polypeptide, such as addition of an antigenic epitope or addition of 
a detectable moiety; or 4) to provide equivalent or better binding to a SOC/CRAC polypeptide 
receptor or other molecule. Alternatively, modifications can be made directly to the 
polypeptide, such as by cleavage, addition of a linker molecule, addition of a detectable 
moiety, such as biotin, addition of a fatty acid, and the like. Modifications also embrace 
fusion proteins comprising all or part of the SOC/CRAC amino acid sequence. One of skill in 
the art will be familiar with methods for predicting the effect on protein conformation of a 
change in protein sequence, and can thus "design" a variant SOC/CRAC polypeptide 
according to known methods. One example of such a method is described by Dahiyat and 
Mayo in Science 278:82-87, 1997, whereby proteins can be designed de novo. The method 
can be applied to a known protein to vary only a portion of the polypeptide sequence. By 
applying the computational methods of Dahiyat and Mayo, specific variants of a SOC/CRAC 
calcium channel polypeptide can be proposed and tested to determine whether the variant 
retains a desired conformation. 
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Variants can include SOC/CRAC polypeptides which are modified specifically to alter 
a feature of the polypeptide unrelated to its physiological activity. For example, cysteine 
residues can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, 
certain amino acids can be changed to enhance expression of a SOC/CRAC polypeptide by 
eliminating proteolysis by proteases in an expression system (e.g., dibasic amino acid residues 
in yeast expression systems in which KEX2 protease activity is present). 

Mutations of a nucleic acid which encodes a SOC/CRAC polypeptide preferably 
preserve the amino acid reading frame of the coding sequence and, preferably, do not create 
regions in the nucleic acid which are likely to hybridize to form secondary structures, such as 
hairpins or loops, which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant SOC/CRAC polypeptides) which are silent as to the amino 
acid sequence of the polypeptide, but which provide preferred codons for translation in a 
particular host. The preferred codons for translation of a nucleic acid in, e.g., E. coli. are well 
known to those of ordinary skill in the art. Still other mutations can be made to the noncoding 
sequences of a SOC/CRAC gene or cDNA clone to enhance expression of the polypeptide. 

The skilled artisan will realize that conservative amino acid substitutions may be made 
in SOC/CRAC polypeptides to provide functionally equivalent variants of the foregoing 
polypeptides, i.e, the variants retain the functional capabilities of the SOC/CRAC 
polypeptides. As used herein, a "conservative amino acid substitution" refers to an amino 
acid substitution which does not alter the relative charge or size characteristics of the protein 
in which the amino acid substitution is made. Variants can be prepared according to methods 
for altering polypeptide sequence known to one of ordinary skill in the art such as are found in 
references which compile such methods, e.g. Molecular Cloning: A Laboratory Manual^ J. 
Sambrook, et al., eds.. Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, et al., 
eds., John Wiley & Sons, Inc., New York. Exemplary functionally equivalent variants of the 
SOC/CRAC polypeptides include conservative amino acid substitutions of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, and/or SEQ ID NO:32. Conservative substitutions of amino acids 
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include substitutions made amongst amino acids withm the following groups: (a) M, I, L, V; 
(b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E. D. 

Thus functionally equivalent variants of SOC/CRAC polypeptides, i.e., variants of 
SOC/CRAC polypeptides which retain the function of the natural SOC/CRAC polypeptides, 
are contemplated by the invention. Conservative amino-acid substitutions in the amino acid 
sequence of SOC/CRAC polypeptides to produce functionally equivalent variants of 
SOC/CRAC polypeptides typically are made by alteration of a nucleic acid encoding 
SOC/CRAC polypeptides (e.g., SEQ ID NOs:l, 3, 5, 7, 23, 25, 27, 29, 31). Such substitutions 
can be made by a variety of methods known to one of ordinary skill in the art. For example, 
amino acid substitutions may be made by PCR-directed mutation, site-directed mutagenesis 
according to the method of Kunkel (Kvinkel, Proc. Nat. Acad. Sci. U.S.A. 82: 488-492, 1985), 
or by chemical synthesis of a gene encoding a SOC/CRAC polypeptide. The activity of 
functionally equivalent fiagments of SOC/CRAC polypeptides can be tested by cloning the 
gene encoding the altered SOC/CRAC polypeptide into a bacterial or mammalian expression 
vector, introducing the vector into an appropriate host cell, expressing the altered SOC/CRAC 
polypeptide, and testing for a functional capability of the SOC/CRAC polypeptides as 
disclosed herein (e.g., SOC/CRAC calcium channel activity). 

The invention as described herein has a number of uses, some of which are described 
elsewhere herein. First, the invention permits isolation of SOC/CRAC polypeptides, 
including the isolation of the complete SOC/CRAC polypeptide. A variety of methodologies 
well-known to the skilled practitioner can be utilized to obtain isolated SOC/CRAC 
molecules. The polypeptide may be purified from cells which naturally produce the 
polypeptide by chromatographic means or immunological recognition. Alternatively, an 
expression vector may be introduced into cells to cause production of the polypeptide. In 
another method, mRNA transcripts may be microinjected or otherwise introduced into cells to 
cause production of the encoded polypeptide. Translation of SOC/CRAC mRNA in cell-free 
extracts such as the reticulocyte lysate system also may be used to produce SOC/CRAC 
polypeptides. Those skilled in the art also can readily follow known methods for isolating 
SOC/CRAC polypeptides. These include, but are not limited to, immxmochromatography, 
HPLC, size-exclusion chromatography, ion-exchange chromatography and immune-affinity 
chromatography. 

The invention also provides, in certain embodiments, "dominant negative" 
polypeptides derived from SOC/CRAC polypeptides. A dominant negative polypeptide is an 
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inactive variant of a protein, which, by interacting with the cellular machinery, displaces an 
active protein from its interaction with the cellular machinery or competes with the active 
protein, thereby reducing the effect of the active protein. For example, a dominant negative 
receptor which binds a ligand but does not transmit a signal in response to binding of the 
ligand can reduce the biological effect of expression of the ligand. Likewise, a dominant 
negative inactive SOC/CRAC calcium channel which interacts normally with the cell 
membrane but which does not mediate calcium transport can reduce calcium transport in a 
cell. Similarly, a dominant negative transcription factor which binds to a promoter site in the 
control region of a gene but does not increase gene transcription can reduce the effect of a 
normal transcription factor by occupying promoter binding sites without increasing 
transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the 
potential for a dominant negative variant of a protein, and using standard mutagenesis 
techniques to create one or more dominant negative variant polypeptides. See, e.g., U.S. 
Patent No. 5,580,723 and Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual^ 
Second Edition, Cold Spring Harbor Laboratory Press. The skilled artisan then can test the 
population of mutagenized polypeptides for diminution in a selected and/or for retention of 
such an activity. Other similar methods for creating and testing dominant negative variants of 
a protein will be apparent to one of ordinary skill in the art. 

According to another aspect, the invention provides a method for isolating a 
SOC/CRAC molecule having SOC/CRAC calcium channel activity. The method involves 
contacting a binding molecule that is a SOC/CRAC nucleic acid or a SOC/CRAC binding 
polypeptide with a sample containing one or more SOC/CRAC molecules under conditions 
that gdlow such binding (see earlier discussion) to form a complex, detecting the presence of 
the complex, isolating the SOC/CRAC molecule from the complex, and determining whether 
the isolated SOC/CRAC molecule has SOC/CRAC calcixmi channel activity. Thus, the 
invention is useful for identifying and isolating full length complementary (cDNA) or 
genomic nucleic acids encoding SOC/CRAC polypeptides having SOC/CRAC calcium 
chaimel activity. Identification and isolation of such nucleic acids and polypeptides may be 
accomplished by hybridizing/binding, under appropriate conditions well known in the art, 
libraries sind/or restriction enzyme-digested human nucleic acids, with a labeled SOC/CRAC 
molecular probe. As used herein, a "label" includes molecules that are incorporated into, for 
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example, a SOC/CRAC molecule (nucleic acid or peptide), that can be directly or indirectly 
detected. A wide variety of detectable labels are well known in the art that can be used, and 
include labels that provide direct detection (e.g., radioactivity, luminescence, optical or 
electron density, etc), or indirect detection (e.g., epitope tag such as the FLAG epitope, 
enzyme tag such as horseseradish peroxidase, etc.). The label may be bound to a SOC/CRAC 
binding partner, or incorporated into the structure of the binding partner. 

A variety of methods may be used to detect the label, depending on the nature of the 
label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
detected through optical or electron density, radioactive emissions, nonradioactive energy 
transfers, etc. or indirectly detected vnth antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. Once a library clone or 
hybridizing fragment is identified in the hybridization/binding reaction, it can be further 
isolated by employing standard isolation/cloning techniques known to those of skill in the art. 
See, generally, Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory Press. In addition, nucleic acid amplification 
techniques well known in the art, may also be used to locate splice variants of calcium 
channel (or calcium channel subunits) with SOC/CRAC calcium channel activity. Size and 
sequence determinations of the amplification products can reveal splice variants. 

The foregoing isolated nucleic acids and polypeptides may then be compared to the 
nucleic acids and polypeptides of the present invention in order to identify homogeneity or 
divergence of the sequences, and be further characterized functionally to determine whether 
they belong to a family of molecules with SOC/CRAC calcium channel activity (for 
methodology see under the Examples section). 

The isolation of the SOC/CRAC cDNA and/or partial sequences thereof also makes it 
possible for the artisan to diagnose a disorder characterized by an aberrant expression of 
SOC/CRAC. These methods involve determining expression of the SOC/CRAC gene, and/or 
SOC/CRAC polypeptides derived therefrom. In the former situation, such determinations can 
be carried out via any standard nucleic acid determination assay, including the polymerase 
chain reaction, or assaying with labeled hybridization probes as exemplified below. In the 
latter situation, such determination can be carried out via any standard inunimological assay 
using, for example, antibodies which bind to the SOC/CRAC protein. 
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The invention also embraces isolated peptide binding agents which, for example, can 
be antibodies or fragments of antibodies ("binding polypeptides"), having the ability to 
selectively bind to SOC/CRAC polypeptides. Antibodies include polyclonal and monoclonal 
antibodies, prepared according to conventional methodology. In certain embodiments, the 
invention excludes binding agents (e.g., antibodies) that bind to the polypeptides encoded by 
the nucleic acids of SEQ ID NOs: 10, 12, 13, 14, 15, 17, and 19. 

Significantly, as is well-known in the art, only a small portion of an antibody 
molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in 
general, Clark, W.R. (1986) The Experimental Foimdations of Modem Immunology Wiley & 
Sons, Inc., New York; Roitt, L (1991) Essentiztl Immunology. 7th Ed., Blackwell Scientific 
Publications, Oxford). The pFc' and Fc regions, for example, are effectors of the complement 
cascade but are not involved in antigen binding. An antibody from which the pFc' region has 
been enzymatically cleaved, or which has been produced without the pFc' region, designated 
an F(ab')2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 
an antibody firom which the Fc region has been enzymatically cleaved, or which has been 
produced without the Fc region, designated an Fab fi-agment, retains one of the antigen 
binding sites of an intact antibody molecule. Proceeding further. Fab fragments consist of a 
covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 
The Fd fragments are the major determinant of antibody specificity (a single Fd fragment may 
be associated with up to ten different light chains without altering antibody specificity) and Fd 
fragments retain epitope-binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there 
are complementarity determining regions (CDRs), which directly interact with the epitope of 
the antigen, and framework regions (FRs), which maintain the tertiary structure of the 
paratope (see, in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and 
the light chain of IgG immunoglobulins, there are four framework regions (FRl through FR4) 
separated respectively by three complementarity determining regions (CDRl through CDR3). 
The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, 
are largely responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian 
antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
v^le retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "himianized" antibodies in which non-human CDRs 
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are covalently joined to human FR and/or Fc/pFc' regions to produce a functional antibody. 
Thxis, for example, PCX International Publication Number WO 92/04381 teaches the 
production and use of humanized murine RSV antibodies in which at least a portion of the 
murine FR regions have been replaced by FR regions of human origin. Such antibodies, 
including fragments of intact antibodies with antigen-binding ability, are often referred to as 
"chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab')2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or 
FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab')2 fragment antibodies in which 
the FR and/or CDRl and/or CDRl and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 
FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDRl and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 

Thus, the invention involves binding pol3^ptides of numerous size and type that bind 
selectively to SOC/CRAC polypeptides, and complexes containing SOC/CRAC polypeptides. 
These binding polypeptides also may be derived also from sources other than antibody 
technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in inmiobilized form, as bacterial 
flagella peptide display libraries or as phage display libraries. Combinatorial libraries also 
can be synthesized of peptides containing one or more amino acids. Libraries further can be 
synthesized of peptides and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or 
lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
One then can select phage-bearing inserts which bind to the SOC/CRAC polypeptide or a 
complex containing a SOC/CRAC polypeptide. This process can be repeated through several 
cycles of reselection of phage that bind to the SOC/CRAC polypeptide or complex. Repeated 
rounds lead to enrichment of phage bearing particular sequences. DNA sequence analysis can 
be conducted to identify the sequences of the expressed polypeptides. The minimal linear 
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portion of the sequence that binds to the SOC/CRAC polypeptide or complex can be 
determined. One can repeat the procedure using a biased library containing inserts containing 
part or all of the minimal linear portion plus one or more additional degenerate residues 
upstream or downstream thereof. Yeast two-hybrid screening methods also may be used to 
identify polypeptides that bind to the SOC/CRAC polypeptides. Thus, the SOC/CRAC 
polypeptides of the invention, or a fragment thereof, or complexes of SOC/CRAC can be used 
to screen peptide libraries, including phage display libraries, to identify and select peptide 
binding polypeptides that selectively bind to the SOC/CRAC polypeptides of the invention. 
Such molecules can be used, as described, for screening assays, for purification protocols, for 
interfering directly with the functioning of SOC/CRAC and for other piuposes that will be 
apparent to those of ordinary skill in the art. 

A SOC/CRAC polypeptide, or a fragment thereof, also can be used to isolate naturally 
occurring, polypeptide binding partners which may associate with the SOC/CRAC 
polypeptide in the membrane of a cell. Isolation of binding partners may be performed 
according to well-known methods. For example, isolated SOC/CRAC polypeptides can be 
attached to a substrate, and then a solution suspected of containing an SOC/CRAC binding 
partner may be applied to the substrate. If the binding partner for SOC/CRAC polypeptides is 
present in the solution, then it will bind to the substrate-bound SOC/CRAC polypeptide. The 
binding partner then may be isolated. Other proteins which are binding partners for 
SOC/CRAC, may be isolated by similar methods without undue experimentation. 

The invention also provides novel kits which could be used to measure the levels of 
the nucleic acids of the invention, expression products of the invention or anti-SOC/CRAC 
antibodies. In the case of nucleic acid detection, pairs of primers for amplifying SOC/CRAC 
nucleic acids can be included. The preferred kits would include controls such as known 
amounts of nucleic acid probes, SOC/CRAC epitopes (such as SOC/CRAC expression 
products) or anti-SOC/CRAC antibodies, as well as instructions or other printed material. In 
certain embodiments the printed material can characterize risk of developing a disorder that is 
characterized by aberrant SOC/CRAC polypeptide expression based upon the outcome of the 
assay. The reagents may be packaged in containers and/or coated on wells in predetermined 
amoimts, and the kits may include standard materials such as labeled immunological reagents 
(such as labeled anti-IgO antibodies) and the like. One kit is a packaged polystyrene 
microtiter plate coated with a SOC/CRAC polypeptide and a container containing labeled 
anti-human IgG antibodies. A well of the plate is contacted vsdth, for example, serum, washed 
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and then contacted with the anti-IgG antibody. The label is then detected. A kit embodying 
features of the present invention is comprised of the following major elements: packaging an 
agent of the invention, a control agent, and instructions. Packaging is a box-like structure for 
holding a vial (or number of vials) containing an agent of the invention, a vial (or number of 
vials) containing a control agent, and instructions. Individuals skilled in the art can readily 
modify packaging to suit individual needs. 

Another aspect of the invention is a method for determining the level of SOC/CRAC 
expression in a subject. As used herein, a subject is a human, non-hmnan primate, cow, 
horse, pig, sheep, goat, dog, cat or rodent. In all embodiments, human subjects are preferred. 
Expression is defined either as SOC/CRAC mRNA expression or SOC/CRAC polypeptide 
expression. Various methods can be used to measure expression. Preferred embodiments of 
the invention include PCR and Northern blotting for measuring mRNA expression, and 
monoclonal or polyclonal SOC/CRAC antisera as res^ents to measure SOC/CRAC 
polypeptide expression. In certain embodiments, test samples such as biopsy samples, and 
biological fluids such as blood, are used as test samples. SOC/CRAC expression in a test 
sample of a subject is compared to SOC/CRAC expression in control sample to, e.g., assess 
the presence or absence or stage of a proliferative disorder (e.g., a lymphocyte proliferative 
disorder) in a subject. 

SOC/CRAC polypeptides preferably are produced recombinantly, although such 
polypeptides may be isolated fh)m biological extracts. Recombinantly produced SOC/CRAC 
polypeptides include chimeric proteins comprising a fusion of a SOC/CRAC protein with 
another polypeptide, e.g., a polypeptide capable of providing or enhancing protein-protein 
binding, sequence specific nucleic acid binding (such as GAL4), enhancing stability of the 
SOC/CRAC polypeptide under assay conditions, or providing a detectable moiety, such as 
green fluorescent protein. A polypeptide fused to a SOC/CRAC polypeptide or fragment may 
also provide means of readily detecting the fusion protein, e.g., by immimological recognition 
or by fluorescent labeling. 

The invention is also useful in the generation of transgenic non-human animals. As 
used herein, ^'transgenic non-human animals" includes non-human animals having one or 
more exogenous nucleic acid molecules incorporated in germ line cells and/or somatic cells. 
Thus the transgenic animal include "knockout" animals having a homozygous or 
heterozygous gene disruption by homologous recombination, animals having episomal or 
chromosomally incorporated expression vectors, etc. Knockout animals can be prepared by 
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homologous recombination using embryonic stem cells as is well known in the art. The 
recombination may be facilitated using, for example, the cre/lox system or other recombinase 
systems known to one of ordinary skill in the art. In certain embodiments, the recombinase 
system itself is expressed conditionally, for example, in certain tissues or cell types, at certain 
embryonic or post-embryonic developmental stages, inducibly by the addition of a compound 
which increases or decreases expression, and the like. In general, the conditional expression 
vectors used in such systems use a variety of promoters which confer the desired gene 
expression pattern (e.g., temporal or spatial). Conditional promoters also can be operably 
linked to SOC/CRAC nucleic acid molecules to increase expression of SOC/CRAC in a 
regulated or conditional manner. Trans-acting negative regulators of SOC/CRAC calcium 
channel activity or expression also can be operably linked to a conditional promoter as 
described above. Such trans-acting regulators include antisense SOC/CRAC nucleic acids 
molecules, nucleic acid molecules which encode dominant negative SOC/CRAC molecules, 
ribozyme molecules specific for SOC/CRAC nucleic acids, and the like. The transgenic non- 
hiunan animals are useful in experiments directed toward testing biochemical or physiological 
effects of diagnostics or therapeutics for conditions characterized by increased or decreeised 
SOC/CRAC expression. Other uses will be apparent to one of ordinary skill in the art. 

The invention further provides efficient methods of identifying agents or lead 
compounds for agents active at the level of a SOC/CRAC polypeptide (e.g., a SOC/CRAC 
polypeptide) or SOC/CRAC fragment dependent cellular function. In particular, such 
functions include interaction with other polypeptides or fragments thereof, and selective 
binding to certain molecules (e.g., agonists and antagonists). Generally, the screening 
methods involve assaying for compounds which interfere with SOC/CRAC calcium channel 
activity, although compounds which enhance SOC/CRAC calcium charmel activity also can 
be assayed using the screening methods. Such methods are adaptable to automated, high 
throughput screening of compounds. The target therapeutic indications for pharmacological 
agents detected by the screening methods are limited only in that the target cellular function 
be subject to modulation by alteration of the formation of a complex comprising a 
SOC/CRAC polypeptide or fragment thereof and one or more SOC/CRAC binding targets. 
Target indications include cellular processes modulated by SOC/CRAC such as Ca^"*" fluxing, 
and affected by SOC/CRAC ability to form complexes vsdth other molecules and polypeptides 
as, for example, may be present in the cell membrane. 
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A wide variety of assays for pharmacological agents are provided^ including, 
expression assays, labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays, cell-based assays such as calcium transport assays, etc. For 
example, two-hybrid screens are used to rapidly examine the effect of transfected nucleic 
acids on the intracellular binding of SOC/CRAC or SOC/CRAC fragments to specific 
intracellular targets (e.g. a tyrosine kinase). The transfected nucleic acids can encode, for 
example, combinatorial peptide libraries or cDNA libraries. Convenient reagents for such 
assays, e.g., GAL4 fusion proteins, are known in the art. An exemplary cell-based assay 
involves transfecting a cell with a nucleic acid encoding a SOC/CRAC polypeptide fused to a 
GAL4 DNA binding domain and a nucleic acid encoding a reporter gene operably linked to a 
gene expression regulatory region, such as one or more GAL4 binding sites. Activation of 
reporter gene transcription occurs when the SOC/CRAC and reporter fusion polypeptides bind 
such as to enable transcription of the reporter gene. Agents which modulate a SOC/CRAC 
polypeptide mediated cell function are then detected through a change in the expression of 
reporter gene. Methods for determining changes in the expression of a reporter gene are 
known in the art. 

In an expression system, for example, a SOC/CRAC polypeptide is attached to a 
membrane, the membrane preferably separating two fluid environments and being otherwise 
not permeable to Ca^^. Such separation is preferred so that a change in Ca^"*" concentration on 
either side of the membrane is mediated only through the attached SOC/CRAC polypeptide. 
Preferably, a SOC/CRAC polypeptide is expressed in an intact cell and is present on the cell- 
membrane (as in physiologic conditions). The cell expressing the SOC/CRAC polypeptide is 
preferably a eukaryotic cell, and the SOC/CRAC polypeptide is preferably recombinantly 
expressed, although cells naturally expressing a SOC/CRAC polypeptide may also be used. 
Synthetic membranes, however, containing SOC/CRAC polypeptides may also be used. See, 
e.g., K. Kiselyov, et al,. Functional interaction between InsP3 receptors and store-operated 
Htrp3 channels. Nature 396. 478-82 (1 998). 

The cell expressing the SOC/CRAC polypeptide is incubated imder conditions which, 
in the absence of the candidate agent, permit calcium flux into the cell and allow detection of 
a reference calcium concentration. For example, depletion of intracellular calcium stores with 
thapsigargin or other agents (Putney, J.W. Jr., in Capacitative Calcium Entry. R.G. Landes 
Co. and Chapman & Hall, 1997) would produce a given level of SOC/CRAC channel 
activation and a given reference calcium concentration. Detection of a decrease in the 
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foregoing activities (i.e., a decrease in the intracellular calcium concentration) relative to the 
reference calcium concentration indicates that the candidate agent is a lead compound for an 
agent to inhibit SOC/CRAC calcium channel activity. Preferred SOC/CRAC polypeptides 
include the polypeptides of claim 15. 

SOC/CRAC fragments used in the methods, when not produced by a transfected 
nucleic acid are added to an assay mixture as an isolated polypeptide. SOC/CRAC 
polypeptides preferably are produced recombinantly, although such polypeptides may be 
isolated from biological extracts or chemically synthesized. Recombinantly produced 
SOC/CRAC polypeptides include chimeric proteins comprising a fusion of a SOC/CRAC 
protein with another polypeptide, e.g., a polypeptide capable of providing or enhancing 
protein-protein binding, sequence specific nucleic acid binding (such as GAL4), enhancing 
stability of the SOC/CRAC polypeptide under assay conditions, or providing a detectable 
moiety, such as green fluorescent protein or Flag epitope. 

The assay mixture is comprised of a SOC/CRAC polypeptide binding target 
(candidate agent) capable of interacting with a SOC/CRAC polypeptide. While natural 
SOC/CRAC binding targets may be used, it is frequently preferred to use portions (e.g., 
peptides or nucleic acid fragments) or analogs (i.e., agents which mimic the SOC/CRAC 
binding properties of the natural binding target for purposes of the assay) of the SOC/CRAC 
binding target so long as the portion or analog provides binding afSfinity and avidity to the 
SOC/CRAC polypeptide (or fragment thereof) measurable in the assay. 

The assay mixture also comprises a candidate agent (binding target, e.g., 
agonist/antagonist). Typically, a plurality of assay mixtures are run in parallel with different 
agent concentrations to obtain a different response to the various concentrations. Typically, 
one of these concentrations serves as a negative control, i.e., at zero concentration of agent or 
at a concentration of agent below the limits of assay detection. Candidate agents encompass 
numerous chemical classes, although typically they are orgemic compounds. Preferably, the 
candidate agents are small organic compounds, i.e., those having a molecular weight of more 
than 50 yet less than about 2500, preferably less than about 1000 and, more preferably, less 
than about 500. Candidate agents comprise functional chemical groups necessary for 
structural interactions with polypeptides and/or nucleic acids, and typically include at least an 
amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional 
chemical groups and more preferably at least three of the functional chemicsd groups. The 
candidate agents can comprise cyclic carbon or heterocyclic structure and/or aromatic or 
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polyaromatic structures substituted with one or more of the above-identiiied functional 
groups. Candidate ^ents also can be biomolecules such as peptides, saccharides, fatty acids, 
sterols, isoprenoids, purines, pyrimidines, derivatives or structural analogs of the above, or 
combinations thereof and the like. Where the agent is a nucleic acid, the agent typically is a 
DNA or RNA molecule, although modified nucleic acids as defined herein are also 
contemplated. 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are avsiilable for random and 
directed synthesis of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides, S5^thetic organic combinatorial libraries, phage 
display libraries of random peptides, and the like. Alternatively, libraries of natural 
compoimds in the form of bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural and synthetically produced libraries and compoimds can be 
readily modified through conventional chemical, physical, and biochemical means. Further, 
known agents may be subjected to directed or random chemical modifications such as 
acylation, alkylation, esterification, amidification, etc. to produce structural analogs of the 
agents, Non-SOC/CRAC calcium channel agonists and antagonists, for example, include 
agents such as dihydropyridines (DHPs), phenylalkylamines, omega conotoxin 
(omega.-CgTx) and pyrazonoylguanidines. 

A variety of other reagents also can be included in the mixture. These include 
reagents such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be 
used to facilitate optimal protein-protein, protein-nucleic acid, and/or protein/membrane 
component binding association. Such a reagent may also reduce non-specific or background 
interactions of the reaction components. Other reagents that improve the efficiency of the 
assay such as protease, inhibitors, nuclease inhibitors, antimicrobial agents, and the like may 
also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, 
but for the presence of the candidate agent, the SOC/CRAC polypeptide specifically binds the 
cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay 
may be readily determined. Such experimentation merely involves optimization of the assay 
parameters, not the fundamental composition of the assay. Incubation temperatures typically 
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are between 4°C and 40**C. Incubation times preferably are minimized to facilitate rapid, high 
throughput screening, and typically are between 0.1 and 10 hours. 

After incubation, the presence or absence of specific binding between the SOC/CRAC 
polypeptide and one or more binding targets is detected by any convenient method available 
to the user. For cell free binding type assays, a separation step is often used to separate bound 
from unbound components. The separation step may be accomplished in a variety of ways. 
Conveniently, at least one of the components is immobilized on a solid substrate, from which 
the unbound components may be easily separated. The solid substrate can be made of a wide 
variety of materials and in a vsdde variety of shapes, e.g., microliter plate, microbead, dipstick, 
resin particle, etc. The substrate preferably is chosen to maximimi signal to noise ratios, 
primarily to minimize background binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a 
reservoir, emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, 
particle, chromotograpic column or filter with a wash solution or solvent. The separation step 
preferably includes multiple rinses or washes. For example, when the solid substrate is a 
microtiter plate, the wells may be washed several times with a washing solution, which 
tj^ically includes those components of the incubation mixture that do not participate in 
specific bindings such as salts, buffer, detergent, non-specific protein, etc. Where the solid 
substrate is a magnetic bead, the beads may be washed one or more times with a washing 
solution and isolated using a magnet. 

Detection may be effected in any convenient way for cell-based assays such as two- or 
three-hybrid screens. The transcript resulting from a reporter gene transcription assay of 
SOC/CRAC polypeptide interacting with a target molecule typically encodes a directly or 
indirectly detectable product, e.g., P-galactosidase activity, luciferase activity, and the like. 
For cell-free binding assays, one of the components usually comprises, or is coupled to, a 
detectable label. A wide variety of labels can be used, such as those that provide direct 
detection (e.g., radioactivity, luminescence, optical or electron density, etc.) or indirect 
detection (e.g., epitope tag such as the FLAG epitope, enzyme tag such as horseseradish 
peroxidase, etc.). The label may be bound to a SOC/CRAC binding partner, or incorporated 
into the structure of the binding partner. 

A variety of methods may be used to detect the label, depending on the nature of the 
label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation fix>m the solid substrate. Labels may be directly 
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detected through optical or electron density, radioactive emissions, nonradiative energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. 

Of particular importance in any of the foregoing assays and binding studies is the use 
of a specific sequence motif identified in the S0C-2/CRAC-1 polypeptide sequence as a 
kinase catalytic domedn. According to the invention, amino acids 999-1180 of the SOC- 
2/CRAC-l polypeptide (SEQ ID NO:24) (or a fragment thereof), show a localized homology 
with the catalytic domains of eukaryotic elongation factor-2 kinase (eEF-2 kinase, GenBank 
Acc. no. U93850) and Dictyostelium myocin heavy chain kinase A (MHCK A, GenBank Acc. 
no. U 16856), as disclosed in Ryazanov AG, et al., Proc Natl Acad Sci USA, 1997, 
94(10):4884-4889. Therefore, according to the invention, a method for identifying agents 
useful in the modulation of SOC/CRAC polj^jeptide kinase activity is provided. The method 
involves contacting a SOC/CRAC poi3rpeptide with kinase activity, that includes, for 
example, amino acids 999-1180 of the SOC-2/CRAC-1 polypeptide (SEQ ID NO:24) with a 
candidate agent suspected of modulating SOC/CRAC kinase activity, imder conditions 
sufficient to allow the candidate agent to interact with the SOC/CRAC polypeptide and 
modulate its kinase activity; detecting a kinase activity associated with the SOC/CRAC 
polypeptide in the presence of the candidate agent; and comparing the kinase activity in the 
previous step with a control kinase activity of a SOC/CRAC polypeptide in the absence of the 
candidate agent to determine whether the candidate agent modulates (increases or decreases) 
SOC/CRAC kinase activity. Other controls for kinase activity can also be performed at the 
same time, for example, by utilizing eEF-2 kin£ise and/or Dictyostelium MHC Kinase A, in a 
similar manner to the SOC/CRAC member. Methods for performing such kinase activity 
assays are well known in the art. 

The invention thus provides SOC/CRAC-specific binding agents, methods of 
identifying and making such agents, and their use in diagnosis, therapy and pharmaceutical 
development. For example, SOC/CRAC-specific agents are useful in a variety of diagnostic 
and therapeutic applications, especially where disease or disease prognosis is associated with 
altered SOC/CRAC and SOC/CRAC calcium channel fluxing characteristics. Novel 
SOC/CRAC-specific binding agents include SOC/CRAC-specific antibodies and other natural 
intracellular and extracellular binding agents identified with assays such as two hybrid 
screens, and non-natural intracellular and extracellular binding agents identified in screens of 
chemical libraries and the like. 
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In general, the specificity of SOC/CRAC binding to a specific molecule is determined 
by binding equilibrium constants. Targets which are capable of selectively binding a 
SOC/CRAC polypeptide preferably have binding equilibriimi constants of at least about 10^ 
M"\ more preferably at least about 10* M"', and most preferably at least about 10^ M"'. The 
wide variety of cell based and cell free assays may be used to demonstrate SOC/CRAC- 
specific binding. Cell based assays include one, two and three hybrid screens, assays in 
which SOC/CRAC-mediated transcription is inhibited or increased, etc. Cell j&ee assays 
include SOC/CRAC-protein binding assays, immunoassays, etc. Other assays useful for 
screening agents which bind SOC/CRAC polypeptides include fluorescence resonance energy 
transfer (FRET), and electrophoretic mobility shift analysis (EMSA). 

Various techniques may be employed for introducing nucleic acids of the invention 
into cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. 
Such techniques include transfection of nucleic acid-CaP04 precipitates, trsmsfection of 
nucleic acids associated with DEAE, transfection with a retrovirus including the nucleic acid 
of interest, liposome mediated transfection, and the like. For certain uses, it is preferred to 
target the nucleic acid to particular cells. In such instances, a vehicle used for delivering a 
nucleic acid of the invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have 
a targeting molecule attached thereto. For example, a molecule such as an antibody specific 
for a surface membrane protein on the target cell or a ligand for a receptor on the target cell 
can be bound to or incorporated within the nucleic acid delivery vehicle. For example, where 
liposomes are employed to deliver the nucleic acids of the invention, proteins which bind to a 
surface membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 
fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 
intemahzation in cycling, proteins that target intracellular localization and enhance 
intracellular half life, and the like. Polymeric delivery systems also have been used 
successfully to deliver nucleic acids into cells, as is known by those skilled in the art. Such 
systems even permit oral delivery of nucleic acids. 

Other delivery systems can include time-release, delayed release or sustained release 
delivery systems. Such systems can avoid repeated administrations of the anti-inflammatory 
agent, increasing convenience to the subject and the physician. Many types of release 
delivery systems are available and known to those of ordinary skill in the art. They include 
polymer base systems such as poly(lactide-glycolide), copolyoxalates, polycaprolactones. 
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poiyesteramides, polyorthoesters, . polyhydroxybutyric acid, and poly anhydrides. 
Microcapsules of the foregoing polymers containing drugs are described in, for example, U.S. 
Patent 5,075,109. Delivery systems also include non-polymer systems that are: lipids 
including sterols such as cholesterol, cholesterol esters and fatty acids or neutral fats such as 
mono- di- and tri-glycerides; hydrogel release systems; sylastic systems; peptide based 
systems; wax coatings; compressed tablets using conventional binders and excipients; 
partially fused implants; and the like. Specific examples include, but are not limited to: (a) 
erosional systems in which an agent of the invention is contained in a form within a matrix 
such as those described in U.S. Patent Nos. 4,452,775, 4,675,189, and 5,736,152, and (b) 
diffusional systems in which an active component permeates at a controlled rate from a 
polymer such as described in U.S. Patent Nos. 3,854,480, 5,133,974 and 5,407,686. In 
addition, pump-based hardware delivery systems can be used, some of which are adapted for 
implantation. 

Use of a long-term sustained release implant may be particularly suitable for treatment 
of chronic conditions. Long-term release, are used herein, means that the implant is 
constructed and arranged to delivery therapeutic levels of the active ingredient for at least 30 
days, and preferably 60 days. Long-term sustained release implants are well-known to those 
of ordinary skill in the art and include some of the release systems described above. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 
gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history 
of that patent, all of which are publicly available documents. In general, it involves 
introduction in vitro of a functional copy of a gene into a cell(s) of a subject which contains a 
defective copy of the gene, and retimiing the genetically engineered cell(s) to the subject. The 
functional copy of the gene is under operable control of regulatory elements which permit 
expression of the gene in the genetically engineered cell(s). Numerous transfection and 
transduction techniques as well as appropriate expression vectors eire well known to those of 
ordinary skill in the art, some of which are described in PCT application WO95/00654. In 
vivo gene therapy using vectors such as adenovirus, retroviruses, herpes virus, and targeted 
liposomes also is contemplated according to the invention. See, e.g., U.S. Patent Nos. 
5,670,488, entitled "Adenovirus Vector for Gene Therapy", issued to Gregory et al., and 
5,672,344, entitled "Viral-Mediated Gene Transfer System", issued to Kelley et al. 
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The invention will be more fully understood by reference to the following examples. 
These examples, however, are merely intended to illustrate the embodiments of the invention 
and are not to be construed to limit the scope of the invention. 

Examples 

As an initial approach to identifying SOC/CRAC channels, we considered publicly 
available data and hypothesized that the following characteristics are likely to be exhibited by 
SOC/CRAC calciimi channels: i) SOC/CRAC calcium channels would be integral membrane 
proteins related (probably distantly) to one of the known calcium channel families (e.g. 
voltage gated, ligand gated, Trp), and therefore should have a pore region formed by a 
tetramer of 6-7 transmembrane (TM) regions; ii) high calcium selectivity was likely to come 
at the price of complexity, and therefore these were likely to be large proteins; iii) the high 
calcium selectivity of this type of chaimel was likely to be useful and, therefore, highly 
conserved; and iv) these channels should be expressed in one or more types of lymphocytes, 
since ICRAC is best defined in those cell types. Since the full genome of the nematode C. 
elegans is nearing completion, and IP3-dependent calcium signals have recently been shown 
to be required for one or more aspects of C. elegans development, we took the set of proteins 
encoded by this genome (at the time this search was initiated WORMPEP14 was the available 
predicted protein set) and began searching for proteins which fit the criteria above. This 
search began by proceeding in alphabetical order through WORMPEP14 and arbitrarily 
excluding all proteins below ^proximately 1000 amino acids in size, followed by focusing on 
remaining proteins with clear TM spaiming regions similar to those of other calcium channels. 
We stopped this screen on encoxmtering a protein designated C05C12.3, a predicted protein 
of 1816 amino acids (SEQ ID NO: 13). C05C12.3 was noteable because its central pore 
region had some sequence similarity to but was clearly distinct from members of the Trp 
family of calcium channels, and the hydrophobicity plot of this region showed a 
characteristically wide spacing between the fifth and sixth TM regions for the amino acid 
residues which are thought to line the channel pore region and mediate the calcium selectivity 
of the channels. In addition, it lacked any ankyrin repeats in the region amino-terminal to its 
pore region, further distinguishing it from other Tip family proteins. 

We then used C05C12.3 for BLAST aligrmient screening of the rest of the C. elegans 
genome and also mammalian databases for homologous proteins, revealing two other C. 
elegans homologues (SEQ ID NO: 14 and SEQ ID NO: 15), and also a recently cloned 
mammalian protein named melastatin-1 (MLSN-l/SOC-1, SEQ ID NOs:9 and 10, and 
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GenBank Acc. No. AF071787). Using these sequences, we subsequently performed an 

exhaustive screening of publicly accessible EST databases in search of lymphocyte 

homologues, but were unsuccessful in detecting any homologous transcripts in any 

lymphocyte lines. Since MLSN-1 (SEQ ID NOs:9 and 10) was expressed exclusively in 

melanoc3^es and retina by Northern blot hybridization and by EST database searching, there 

was no evidence that this type of channel was expressed in the type of cell in which ICRAC- 

like currents were best defined. Subsequent BLAST searches picked up mouse EST 

sequence AI098310 (SEQ ID NO:22) from a monocyte cell line. The I.M.A.G.E. consortium 

clone containing the above-identified EST was then purchased fi-om ATCC (clone ID. 

1312756, Manassas, VA) and was further characterized. Using other portions of this 

sequence in EST searches, we subsequently picked up similar sequences in human B-cells 

(SEQ ID NOs:20 and 21), and other cell types as well (SEQ ID NOs: 1 1, 12, 16, 17, 18, and 

19). Most of these sequences were subsequently identified to be part of the 3'-UTR or of the 

carboxy terminal region of the proteins, which are not readily identifiable as Trp channels, 

providing an explanation for the art's inability to detect any type of Trp related transcripts in 

lymphocj^es. Partial sequences fix)m the 5' and/or 3' ends of the above identified clones were 

then used to screen leukoc3^e and kidney cDNA libraries to extend the original sequences 

more toward the 5' and/or 3' ends. 

In view of the foregoing, it was concluded that channels of this type were expressed in 

many types of lymphocytes, and therefore were members of a new family of SOC/CRAC 

calcium channels. 

Experimental Procedures 
Screening of the cDNA libraries 

Leukocyte and kidney cDNA libraries fi-om Life Technologies (Gaithersburg, MD) 

were screened using the Gene Trapper II methodology (Life Technologies) according to 

manufacturer's reconunendation, using the inserts of I.M.A.G.E. clone ID nos. 1312756 and 

1076485 from ATCC (Manassas, VA), under stringent hybridization conditions. Using 

standard methodology ( Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., 

Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, 

or Current Protocols in Molecular Biology^ P.M. Ausubel, et al., eds., John Wiley & Sons, 

Inc., New York), individual cDNA clones were subjected to 3-4 rounds of amplification and 

purification under the same hybridization conditions. 

After excision from the vector and subcloning of inserts into the plasmid forms, 

several clones were sequenced by the Beth Ismel Deaconess Medical Center's Automated 
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Sequencing Facility. Molecular biological techniques such as restriction enzyme treatment, 
subcloning, DNA extraction, bacterial culture and purification of DNA firagments were 
performed according to methods well known in the art. Computer analyses of protein and 
DNA sequences was done using "Assemblylign" (Oxford Molecular, Cambell, CA). Multiple 

5 alignments of the SOC/CRAC family members were produced using the CLUSTAL facility 
of the Mac Vector program. Restriction endonucleeises, expression vectors, and modifying 
enzymes were purchased from commercial sources (Gibco-BRL). Sequencing vectors for 
DNA were purchased from Stratagene (La Jolla, CA). 

Once the first members of what appeared to be a novel family of calcium channel 

10 receptors were identified and characterized, additional BLAST alignments were performed 
with the newly characterized nucleic acid sequences. An initial match was with genomic 
DNA fragment NH0332L1 1 (Genbank Acc. No. AC005538). Using this genomic sequence, 
promers were designed and a number of cDNA libraries was surveyed by PCR. A prostate 
specific message was identified and characterized, leading to the isolation and 

IS characterization of SOC-4/CRAC-3 (SEQ ID NOs: 3 1 and 32). 
Functional Assays 

Transient Expression of SOC/CRAC 

In our initial transient expression experiments, we expressed or expect to express a 
SOC/CRAC molecule transiently in RBL-2H3 mast cells, Jurkat T cells, and A20 

20 B-lymphocytes using both electroporation and vaccinia virus-driven expression, and 
measured the calcium influx produced by depletion of intracellular calcium stores with 
thapsigargin. Each of the foregoing techniques is well known to those of ordinary skill in the 
art and can be performed using various methods (see, e.g., Cxirrent Methods in Molecular 
Biology, eds. Ausubal, P.M., et al. 1987, Green Publishers and Wiley Interscience, N.Y., 

25 N.Y.). Exemplary methods are described herein. 

Depletion of intracellular calcium stores is accomplished by treating the cells with 1 
micromolar thapsigargin; alternative agents which ftinction to deplete intracellular stores are 
described in by Putney, J.W. Jr., in Capacitative Calcixmi Entry, R.G. Landes Co. and 
Chapman & Hall, 1997 and include, for example, ionomycin, cyclopiazonic acid, and DBHQ. 

30 Calcium influx is determined by measuring cytoplasmic calcium as indicated using the 

fura-2 fluorescent calcium indicator (see, e.g., G. Grynkiewicz, M. Poenie, R. Y. Tsien, A 
new generation of Ca2+ indicators with greatly improved fluorescence properties, J. Biol 
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Chem 260, 3440-50 (1985). and M, Poenie, R. Tsien, Fura-2: a powerful new tool for 
measuring and imaging [Ca2+]i in single cells. Prog Clin Biol Res 210 . 53-6 (1986)). 

Patch Clamp Analysis and Determinins^ Selectivity ofSOC/CRAC 

Patch clamp analysis of cells injected with SOC/CRAC cRNA is performed by using 
5 the general patch technique as described in Neher, E., "Ion channels for communication 
between and within cells". Science, 1992 ; 256:498-502, Specific techniques for applying the 
patch clamp analysis to RBL cells are described in Hoth, M., and Penner, R., "Depletion of 
intracellular calcium stores activates a calcium current in mast cells". Nature. 1992 : 
355:3535-355, Additional protocols for applying the patch clamp technique to other cell 
10 types are described in Putney, J.W. Jr., in Capacitative Calcium Entry, R.G. Landes Co. and 
Chapman & Hall, 1997 

An exemplary protocol for patch clamp analysis of SOC/CRAC molecule expressed in 
RBL-2H3 mast cells iising a recombinant vaccinia virus is as follows. The currents elicited 
by store depletion are determined using the whole cell configuration (Neher, E., Science, 
IS 1992 : 256:498-502). Currents in SOC/CRAC expressing cells are conipared to currents in 
control cells expressing an irrelevant protein or a classic Trp family calcium channel known 
as VRl (M. J. Caterina, et al.. The capsaicin receptor: a heat-activated ion channel in the pain 
pathway [see comments]. Nature 389. 816-24 (1997)) in order to assess the contribution of 
SOC/CRAC expression. In addition, the magnitude of whole cell currents in the presence of 
20 extracellular calcium (10 mM), barium (10 mM), or magnesium (10 mM) are compared to 
determine the relative permeability of the channels to each of these ions (Hoth, M., and 
Penner, R., Nature, 1992 : 355:3535-355) and, thereby, determine the ionic selectivity. 

Pharmacologic Behavior of SOC/CRAC 

For analysis of the pharmacologic behavior of a SOC/CRAC molecule, a SOC/CRAC 
25 molecule is expressed in RBL-2H3 mast cells using a recombinant vaccinia virus, and the 
degree of calcium influx elicited by store depletion is monitored using a bulk 
spectrofluorimeter or a fluorescence microscope and the calcium sensitive dye fura-2 (G. 
Grrynkiewicz, M. Poenie, R. Y. Tsien, A new generation of Ca2"*' indicators with greatly 
unproved fluorescence properties, J Biol Chem 260, 3440-50 (1985) and M. Poenie, R. 
30 Tsien, Fura-2: a powerful new tool for measuring and imaging [Ca2+li in single cells. Prog 
Clin Biol Res 210 , 53-6 (1986)). The level of cytoplasmic calcium in SOC/CRAC 
expressing cells is compared to the level achieved in control cells expressing an irrelevant 
protein or a classic Trp. family calcium channels known as VRl (M. J. Caterina, et al.. The 
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capsaicin receptor: a heat-activated ion channel in the pain pathway [see comments]. Nature 
389, 816-24 (1997)). These cells then are pre-incubated with the desired pharmacologic 
reagent, and again the response to store depletion is monitored. Comparison of the effect of 
depleting stores in SOC/CRAC expressing cells relative to controls in the presence or absence 
of the pharmacologic reagent is used to assess the ability of that reagent to modulate 
SOC/CRAC activity. Sphingosine is an exemplary molecule that can be used as 
pharmacologic reagents for pharmacologic characterization of SOC/CRAC calcium channels. 
See, e.g., Mathes, C, et al., Calcium release activated calcivim current as a direct target for 
sphingosine, J Biol Chem 273r39) :2502Q-25030 (1998). Other non-specific calcium chaimel 
inhibitors that can be used for this purpose include SKR96365 (Calbiochem) and Lanthanum. 
Bulk Calcium Assays 

Bulk calcium assays can be performed in a PTI Deltascan bulk spectrofluorometer using fura- 
2 as described in Scharenberg AM, et al., EMBO J, 1 995, 14(14):3385-94. 
Gene Targeting 

The method (and reagents) described by Buerstedde JM et al, [Celh 1991, Oct 4;67(1):179- 
88), was used to generate "knockouts" in cells. Briefly, part of the chicken SOC-2/CRAC-1 
genomic sequence coding for the transmembrane region was cloned utilizing the human 
sequence as the probe in a chicken library screen. Chicken SOC-2/CRAC-1 clones were 
isolated and characterized using standard methodology. The putative exon and domain 
arrangement of the chicken SOC-2/CRAC-1, is depicted in Figure 1. The exons coding for 
TM5 (pore region) and TM6, were replaced with promoter/antibiotic cassettes (see Figure 1). 
These targeting vectors were then used to target (and replace) the endogenous gene in DT-40 
cells (chicken B lymphocyte cells). 

Results 

Example 1 : Transient Expression of SOC/CRAC 

In the above-identified cell lines and using both of the foregoing expression 
techniques, SOC/CRAC egression enhances thapsigargin-dependent influx. In addition, 
SOC/CRAC expression also enhances the amoimt of intracellular calcium stores. That this 
effect is likely due to SOC/CRAC acting as a plasma membrane calcium channel can be 
confirmed by producing an in-firame carboxy-terminal translational fusion with green 
fluorescent protein followed by confocal microscopy, revealing that SOC/CRAC is expressed 
predominantly as a plasma membrane calcium channel. 
Example 2 : Patch Clamp Analysis 
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The biophysical characteristics of SOC7CRAC enhanced currents when expressed in 
Xenopus oocytes are determined. SOC/CRAC cRNA injection is able to enhance 
thapsigargin-dependent whole cell currents. In addition, SOC/CRAC does not alter the 
reversal potential of these currents and the determination of the Pca/PNa ratio shows that 
SOC/CRAC charmels are highly calcium selective. 
Example 3 : Pharmacologic Behavior of SOC/CRAC 

The pharmacologic behavior of SOC/CRAC is evaluated as described above. 
SOC/CRAC-enhanced influx is inhibited by sphingosine in a manner that is substantially the 
same as that of endogenous thapsigargin-dependent calcium influx. 
Example 4 : Gene tar^etins 

Transfection of DT-40 cells with the foregoing targeting vectors, selection for 
antibiotic resistance, and screening, is coUectivelly refered to, herein, as a round of targeting. 
For the first round of targeting SOC-2/CRAC-1, 18/24 clones with homologous 
recombination of the targeting construct into one of the endogenous SOC-2/CRAC-1 alleles 
were obtained. On the second round of targeting (in order to target the second allele and 
therefore generate a homozygous SOC-2/CRAC-1 mutant cell), 0/48 clones were obtained. 
These results indicate that a "null" SOC-2/CRAC-1 mutation is detrimental to DT-40 cells, 
and that SOC-2/CRAC-1 is required for cell viability. 

Table I. Nucleotide Sequences with homologies to SOC/CRAC nucleic acids 
Sequences with SEQ ID NOs and GenBank accession numbers: 
SEQ ID N0:9, AB001535, AI226731, H18835, AA419592, AA261842, AA419407, 
AA592910, D86107, AI098310, AF071787, Z77132, Z831 17, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, AC005538, AA654650, AA3701 10, AA313170, 
AA493512, AI670079, AI671853. 

Table II. Amino Acid Sequences with homologies to SOC/CRAC polypeptides 

Sequences with SEQ ID NOs and GenBank accession numbers: 

SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID 
NO:19, AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, 
AA708532, AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP00241 1, 
CAA92726, CAB05572. 

All references, patents, and patent documents disclosed herein are incorporated by 
reference herein in their entirety. 

What is claimed is presented below and is followed by a Sequence Listing. We claim: 
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Claims 

1 . All isolated nucleic acid molecule, comprising: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic 
acid molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and 
SEQ ID NO:31, and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:l . 

3. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:27. 

4. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:29. 

5. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:31. 

6. An isolated nucleic acid molecule selected from the group consisting of 

(a) a unique fragment of a nucleic acid molecule selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:29, and SEQ ID NO:31, 

(b) complements of (a), 

provided that the unique fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from a sequence group consisting of 

(1) sequences having the SEQ. ID NOS. or GenBank accession numbers of Table I, 

(2) complements of (1 ), and 

(3) fragments of (1) and (2). 
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7. The isolated nucleic acid molecule of claim 6, wherein the sequence of contiguous 
nucleotides is selected from the group consisting of: 

(1) at least two contiguous nucleotides nonidentical to the sequence group, 

(2) at least three contiguous nucleotides nonidentical to the sequence group, 

(3) at least four contiguous nucleotides nonidentical to the sequence group, 

(4) at least five contiguous nucleotides nonidentical to the sequence group, 

(5) at least six contiguous nucleotides nonidentical to the sequence group, 

(6) at least seven contiguous nucleotides nonidentical to the sequence group. 

8. The isolated nucleic acid molecule of claim 6, wherein the unique fragment has a size 
selected from the group consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 
14 nucleotides, 16 nucleotides, 18 nucleotides, 20, nucleotides, 22 nucleotides, 24 
nucleotides, 26 nucleotides, 28 nucleotides, 30 nucleotides, 50 nucleotides, 75 nucleotides, 
100 nucleotides, and 200 nucleotides. 

9. The isolated nucleic acid molecule of claim 6, wherein the molecule encodes a 
polypeptide which is immunogenic. 

10. An expression vector comprising the isolated nucleic acid molecule of claims 1, 2, 3, 
4, 5, 6, 7, 8, or 9 operably linked to a promoter. 

11. A host cell transformed or transfected with the expression vector of claim 10. 

12. An isolated polypeptide encoded by the isolated nucleic acid molecule according to 
anyone of claims 1 or 6, wherein the polypeptide comprises a SOC/CRAC polypeptide or a 
unique fragment thereof. 

13. The isolated polypeptide of claim 12, wherein the isolated polypeptide is encoded by 
the isolated nucleic acid molecule of claim 2, 3, 4, or 5. 

14. The isolated polypeptide of claim 13, wherein the isolated polypeptide comprises a 
polypeptide having the sequence of amino acids selected from the group consisting of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, and SEQ ID NO:32. 
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ls. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 1, 2, 3, 

4, or S, wherein the polypeptide, or unique fragment thereof is immunogenic. 

16. An isolated binding polypeptide which binds selectively to a polypeptide encoded by 
the isolated nucleic acid molecule of claim 1, 2, 3, 4, or 5. 

17. The isolated binding polypeptide of claim 16, wherein the isolated binding 
polypeptide binds to a polypeptide having the sequence of amino acids selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32. 

18. The isolated binding polypeptide of claim 17, wherein the isolated binding 
polypeptide is an antibody or an antibody fragment selected from the group consisting of a 
Fab fragment, a F(ab)2 fragment or a fragment including a CDR3 region selective for the 
polypeptide. 

19. An isolated polypeptide, comprising a unique fragment of the polypeptide of claim 12 
of sufficient length to represent a sequence unique within the human genome, provided that 
the fragment excludes a sequence of contiguous amino acids identified in Table II. 

20. A method for isolating a SOC/CRAC molecule having SOC/CRAC calcium channel 
activity, comprising: 

a) contacting a binding molecule that is a SOC/CRAC nucleic acid or a SOC/CRAC 
binding polypeptide with a sample containing one or more SOC/CRAC molecules, under 
conditions sufficient to form a complex of the SOC/CRAC nucleic acid or the SOC/CRAC 
binding polypeptide and the SOC/CRAC molecule; 

b) detecting the presence of the complex; 

c) isolating the SOC/CRAC molecule from the complex; and 

d) determining whether the isolated SOC/CRAC molecule has SOC/CRAC calcium 
chaimel activity. 

21 . The method of claim 20, vdierein the binding molecule is a SOC/CRAC nucleic acid. 

22. The method of claim 20, wherein the binding molecule is a SOC/CRAC binding 
polypeptide. 
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23. The method of claim 21, wherein the SOC/CRAC nucleic acid comprises at least 14 
nucleotides from any contiguous portion of a sequence of nucleotides selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:3I. 

5 24. A method for identifying agents useful in the modulation of SOC/CRAC calcium 
channel activity, comprising: 

a) contacting a SOC/CRAC polypeptide with a candidate agent suspected of 
modulating SOC/CRAC calcium channel activity, under conditions sufficient to allow the 
SOC/CRAC polypeptide to interact selectively with the candidate agent; 
w b) detecting a Ca^*** concentration associated with SOC/CRAC calcium chaimel 

activity of the SOC/CRAC polypeptide in the presence of the candidate agent; and 

c) comparing the Ca^"*" concentration of step (b) with a control Ca^^ concentration of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates SOC/CRAC calcium channel activity. 

15 

25. A method for determining the level of SOC/CRAC expression in a subject, 
comprising: 

a) measuring the expression of SOC/CRAC in a test sample obtained from the 
subject, and 

. 20 b) comparing the measured expression of SOC/CRAC in the test sample to the 

expression of the SOC/CRAC poljqDeptide in a control to determine the level of SOC/CRAC 
expression in the subject. 

26. The method of claim 25, wherein the expression of SOC/CRAC in (b) is SOC/CRAC 
25 mRNA expression. 

27. The method of claim 25, wherein the expression of SOC/CRAC in (b) is SOC/CRAC 
polypeptide expression. 

28. The method of claim 25, wherein the test sample is tissue. 



29. 



The method of claim 25, wherein the test sample is a biologiczil fluid. 
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30. The method of claim 26, wherein SOC/CRAC mRNA expression is measured using 
the Polymerase Chain Reaction (PGR). 

31. The method of claim 26, wherein SOC/CRAC mRNA expression is measured using a 
method selected from the group consisting of northern blotting, monoclonal antisera to 
SOC/CRAC and polyclonal antisera to SOC/CRAC. 

32. A kit, comprising a package containing: 

an agent that selectively binds to the isolated nucleic acid of claim 1 or an 
expression product thereof, and 

a control for comparing to a measured value of binding of said agent to said 
isolated nucleic acid of claim 1 or expression product thereof. 

33. The kit of claim 32, wherein the control comprises an epitope of the expression 
product of the nucleic acid of claim 1 . 

34. A pharmaceutical composition comprising: 

a pharmaceutically effective amount of an agent comprising of an isolated 
nucleic acid molecule of claim 1 or an expression product thereof, and 
a pharmaceutically acceptable carrier. 

35. The pharmaceutical composition of claim 34, wherein the agent is an expression 
product of the isolated nucleic acid molecule of claim 1 . 

36. A method for identifying agents useful in the modulation of a SOC/CRAC polypeptide 
kinase activity, comprising: 

a) contacting a SOC/CRAC polypeptide with kinase activity with a candidate agent 
suspected of modulating SOC/CRAC kinase activity, under conditions sufficient to allow the 
candidate agent to interact with the SOC/CRAC polypeptide and modulate its kinase activity; 

b) detecting a kinase activity associated with the SOC/CRAC polypeptide in the 
presence of the candidate agent; and 

c) comparing the kinase activity of step (b) with a control kinase activity of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates SOC/CRAC kinase activity. 

37. The method of claim 36, wherein the SOC/CRAC polypeptide comprises 
amino acids 999- 11 80 of the sequence represented as SEQ ID NO:24, or a fragment thereof 
that retains the kinase activity. 
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<110> Beth Israel Deaconess Medical Center, Inc. 
Scharenberg, Andrew 

<120> CHARACTERIZATION OF A CALCIUM CHANNEL FAMILY 

<130> B0662/7026WO/ERP/KA 

<150> U.S. 60/114,220 
<151> 1998-12-30 

<150> U.S. 60/120, 018 
<151> 1999-01-29 

<150> U.S. 60/140,415 
<151> 1999-06-22 

<160> 32 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 1212 

<212> DNA 

<213> Homo Sapiens 



<400> 1 

gcacgaggca aattttttgt tagtacacca tctcagccaa gttgcaaaag ccacttggaa 60 

actggaacca aagatcaaga aactgtttgc tctaaagcta cagaaggaga taatacagaa 120 

tttggagcat ttgtaggaca cagagatagc atggatttac agaggtttaa agaaacatca 180 

aacaagataa aaatactatc caataacaat acttctgaaa acactttgaa acgagtgagt 240 

tctcttgctg gatttactga ctgtcacaga acttccattc ctgttcattc aaaacaagaa 300 

aaaatcagta gaaggccatc taccgaagac actcatgaag tagattccaa agcagcttta 360 

ataccggttt gtagatttca actaaacaga tatatattat taaatacatt aaactttttt 420 

agataagatc tacaaagtgg tgatatttgg gactatatca aaaattcaaa aaaatttttc 480 

ttaagaaaac tgactttagc atagtagcag ttacagaaaa gtttcttaca gtgaatagtc 540 

aggaatttta aagaaaaatt tatgcagaat aaaggcagga atctcttttt gtttgaattg 600 

aagctaatta tatgaactca tttccagcta actgcgataa tgattgattt tgcaaattcc 660 

ctttaaaagc acacactgac aagacaaaaa gctcaggaaa aggcagaaaa attactcctt 720 

tataatcaag tattatatat aagtcagtgc tcataatttt gctcaagaaa atattgactt 780 

acattcatat atatctgttc tggcatagag agattatgtt gttaaaatca tgttattgaa 840 

aaaagttatt tcagtgggga aagaggttag ttaacaaaga gattcacagt aacaaatcct 900 

cctttctgga gggactcttc ctgaccctga gctgcacaac tttgcaacaa attaaagcct 960 

aaccgaagat gacctcacaa tggcaattta gaactcatgg gagtcaactt acataaacgg 1020 

tatttgattt ctgataagat agtggaatta ttggttatag atgacaaaat aagtatgttt 1080 

aaagtgatga tggacataaa aaagttttaa atataaaaca tgagaaaaga aggagatact 1140 

attcaaaaag actggcaaat ttgaaaaact agaaataaaa aaaaaaaaaa aaaatgagcg 1200 

gccgcaagct tt 1212 



<210> 2 
<211> 141 
<212> PRT 

<213> Homo Sapiens 
<400> 2 

Ala Arg Gly Lys Phe Phe Val Ser Thr Pro Ser Gin Pro Ser Cys Lys 

15 10 15 

Ser His Leu Glu Thr Gly Thr Lys Asp Gin Glu Thr Val Cys Ser Lys 
20 25 30 
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Ala 


Thr 


Glu 


Gly 


Asp 


Asn 


Thr 


Glu 


Phe 


Gly Ala 


Phe 


Val 


Gly 


His 


Arg 






35 










H Q 










45 








Asp 


Ser 
50 


Met 


Asp 


Leu 


Gin 


Arg 
55 


Phe 


Lys 


Glu 


Thr 


Ser 
60 


Asn 


Lys 


He 


Lys 


lie 


Leu 


Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


Glu 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ser 


65 










70 










75 










80 


Ser 


Leu 


Ala 


Gly 


Phe 
85 


Thr 


Asp 


Cys 


His 


Arg 
90 


Thr 


Ser 


He 


Pro 


Val 
95 


His 


Ser 


Lys 


Gin 


Glu 
100 


Lys 


lie 


Ser 


Arg 


Arg 
105 


Pro 


Ser 


Thr 


Glu 


Asp 
110 


Thr 


His 


Glu 


val 


Asp 
115 


Ser 


Lys 


Ala 


Ala 


Leu 
120 


He 


Pro 


Val 


Cys 


Arg 
125 


Phe 


Gin 


Leu 


Asn 


Arg 


Tyr 


He 


Leu 


Leu 


Asn 


Thr 


Leu 


Asn 


Phe 


Phe 


Arg 









130 135 140 



<210> 3 

<211> 739 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 
<222> (5)... (5) 
<223> UNKNOWN 

<221> unsure 
<222> <21) . . . (22) 
<223> UNKNOWN 

<221> unsure 
<222> (29) . . . (29) 
<223> UNKNOWN 



<400> 3 

tcgantaggg gtcttccacc nncatactng gatgatggtt ggtgaagtct atgcatacga 60 

aattgatgtg tgtgcaaacg attctgttat ccctcaaatc tgtggtcctg ggacgtggtt 120 

gactccattt cttcaagcag tctacctctt tgwacagtat atcattatgg ttaatcttct 180 

tattgcattt ytcaacaatg tgtatttaca agtgaaggca atttccaata ttgyatggaa 240 

gtaccagcgt tatcatttta ttatggctta tcatgagaaa ccagttctgc ctcctccact 300 

tatcattctt agccatatag tttctctgtt ttgctgcata tgtaagagaa gaaagaaaga 360 

taagacttcc gatggaccaa aacttttctt aacagaagaa gatcaaaaga aacttcatga 420 

ttttgaagag cagtgtgttg aaatgtattt caatgaaaaa gatgacaaat ttcattctgg 480 

gagtgaagag agaattcgtg tcacttttga aagagtggaa cagatgtgca ttcagattaa 540 

agaagttgga gatccgtgtc aactacataa aaagatcatt acaatcatta gattctcaaa 600 

ttggccattt gcaagatctt tcagccctga cggtagatac attaaaaaca ctcactggcc 660 

aaaagcgtcg gaagctagca aagttcataa tgaaatcaca cgagaactga gcatttccaa 720 

acacttggct caaaacctt 739 



<210> 4 
<211> 235 
<212> PRT 

<213> Homo Sapiens 

<220> 

<221> UNSURE 
<222> (41) . . . (41) 
<223> UNKNOWN 

<221> UNSURE 
<222> (54) . . . (54) 
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<223> UNKNOWN 

<221> UNSURE 
<222> (68) . . . (68) 
<223> UNKNOWN 



<400> 4 



Met 


Met 


Val 


Gly 


Glu 


Val 


Tyr 


Ala 


Tyr 


Glu 


He 


Asp 


Val 


Cys 


Ala 


Asn 


1 








5 










10 










15 




Asp 


Ser 


Val 


He 
20 


Pro 


Gin 


He 


Cys 


Gly 
25 


Pro 


Gly 


Thr 


Trp 


Leu 
30 


Thr 


Pro 


Phe 


Leu 


Gin 
35 


Ala 


Val 


Tyr 


Leu 


Phe 
40 


Xaa 


Gin 


Tyr 


He 


He 
45 


Met 


Val 


Asn 


Leu 


Leu 
50 


lie 


Ala 


Phe 


Xaa 


Asn 
55 


Asn 


Val 


Tyr 


Leu 


Gin 
60 


Val 


Lys 


Ala 


He 


Ser 


Asn 


He 


Xaa 


Trp 


Lys 


Tyr 


Gin Arg 


Tyr 


His 


Phe 


He 


Met 


Ala 


Tyr 


65 










70 










75 










80 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Leu 


Ser 


His 


He 








85 










90 










95 




Val 


Ser 


Leu 


Phe 
100 


Cys 


Cys 


He 


Cys 


Lys 
105 


Arg 


Arg 


Lys 


Lys 


Asp 
110 


Lys 


Thr 


Ser 


Asp 


Gly 
115 


Pro 


Lys 


Leu 


Phe 


Leu 
120 


Thr 


Glu 


Glu 


Asp 


Gin 
125 


Lys 


Lys 


Leu 


His 


Asp 
130 


Phe 


Glu 


Glu 


Gin 


Cys 
135 


Val 


Glu 


Met 


Tyr 


Phe 
140 


Asn 


Glu 


Lys 


Asp 


Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


Phe 


Glu 


145 










150 










155 










160 


Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly Asp 


Pro 


Cys 








165 










170 










175 




Gin 


Leu 


His 


Lys 
180 


Lys 


He 


He 


Thr 


He 
185 


He 


Arg 


Phe 


Ser 


Asn 
190 


Trp 


Pro 


Phe 


Ala 


Arg 


Ser 


Phe 


Ser 


Pro 


Asp 


Gly Arg 


Tyr 


He 


Lys 


Asn 


Thr 


His 






195 










200 










205 








Trp 


Pro 
210 


Lys 


Ala 


Ser 


Glu 


Ala 
215 


Ser 


Lys 


Val 


His 


Asn 
220 


Glu 


He 


Thr 


Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 












225 










230 










235 













<210> 5 

<211> 1579 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 

<222> (368) . . . (368) 

<223> g or c 

<221> unsure 

<222> (372) . . . (372) 

<223> g or c 

<221> unsure 

<222> (374) . . . (374) 

<223> g or a 

<221> unsure 

<222> (375) . . . (375) 

<223> g or c 
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<221> unsure 

<222> (387) . . . (387) 

<221> unsure 

<222> (482) . . . (482) 



<400> 5 

acgtcgcctg caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc ggcatggtgt 60 

tgtaaataca cttagctcct ctcttcctca aggtgatctt gaaagtaata atccttttca 120 

ttgtaatatt ttaatgaaag atgacaaaga tccccagtgt aatatatttg gtcaagactt 180 

acctgcagta ccccagagaa aagaatttaa ttttccagag gctggttcct cttctggtgc 240 

cttattccca agtgctgttt cccctccaga actgcgacag agactacatg gggtagaact 300 

cttaaaaata tttaataaaa atcaaaaatt aggcagttca tctactagca taccacatct 360 

gtcatccsca csarscaaat tttttgntag tacaccatct cagccaagtt gcaaaagcca 420 

cttggaaact ggaaccaaag atcaagaaac tgtttgctct aaagctacag aaggagataa 4 80 

tncagaattt ggagcatttg taggacacag agatagcatg gatttacaga ggtttaaaga 540 

aacatcaaac aagataaaaa tactatccaa taacaatact tctgaaaaca ctttgaaacg 600 

agtgagttct cttgctggat ttactgactg tcacagaact tccattcctg ttcattcaaa 660 

acaagaaaaa atcagtagaa ggccatctac cgaagacact catgaagtag attccaaagc 720 

agctttaata ccggtttgta gatttcaact aaacagatat atattattaa atacattaaa 780 

cttttttaga taagatctac aaagtggtga tatttgggac tatatcaaaa attcaaaaaa 84 0 

atttttctta agaaaactga ctttagcata gtagcagtta cagaaaagtt tcttacagtg 900 

aatagtcagg aattttaaag aaaaatttat gcagaataaa ggcaggaatc tctttttgtt 960 

tgaattgaag ctaattatat gaactcattt ccagctaact gcgataatga ttgattttgc 1020 

aaattccctt taaaagcaca cactgacaag acaaaaagct caggaaaagg cagaaaaatt 1080 

actcctttat aatcaagtat tatatataag tcagtgctca taattttgct caagaaaata 1140 

ttgacttaca ttcatatata tctgttctgg catagagaga ttatgttgtt aaaatcatgt 1200 

tattgaaaaa agttatttca gtggggaaag aggttagtta acaaagagat tcacagtaac 1260 

aaatcctcct ttctggaggg actcttcctg accctgagct gcacaacttt gcaacaaatt 1320 

aaagcctaac cgaagatgac ctcacaatgg caatttagaa ctcatgggag tcaacttaca 1380 

taaacggtat ttgatttctg ataagatagt ggaattattg gttatagatg acaaaataag 1440 

tatgtttaaa gtgatgatgg acataaaaaa gttttaaata taaaacatga gaaaagaagg 1500 

agatactatt caaaaagact ggcaaatttg aaaaactaga aataaaaaaa aaaaaaaaaa 1560 

atgagcggcc gcaagcttt 1579 



<210> 6 

<211> 243 

<212> PRT 

<213> Homo Sapiens 

<220> 

<221> UNSURE 

<222> (103) - . . (105) 

<223> UNKNOWN 

<221> UNSURE 

<222> (109) . . . (109) 

<223> UNBCNOWN 

<221> UNSURE 

<222> (141) . . . (141) 

<223> UNKNOWN 

<400> 6 

Val Asn Thr Leu Ser Ser Ser Leu Pro Gin Gly Asp Leu Glu Ser Asn 

1-5 10 15 

Asn Pro Phe His Cys Asn lie Leu Met Lys Asp Asp Lys Asp Pro Gin 

20 25 30 

Cys Asn lie Phe Gly Gin Asp Leu Pro Ala Val Pro Gin Arg Lys Glu 
35 40 45 
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Phe 


Asn 
50 


Phe 


Pro 


Glu 


Ala 


Gly 
55 


Ser 


Ser 


Ser 


Gly 


Ala 
60 


Leu 


Phe 


Pro 


Ser 


Ala 


Val 


Ser 


Pro 


Pro 


Glu 


Leu 


Arg 


Gin 


Arg 


Leu 


His 


Gly 


Val 


Glu 


Leu 


65 










70 










75 










80 


Leu 


Lys 


He 


Phe 


Asn 


Lys 


Asn 


Gin 


Lys 


Leu 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 








85 










90 










95 




He 


Pro 


His 


Leu 
100 


Ser 


Ser 


Xaa 


Xaa 


Xaa 
105 


Lys 


Phe 


Phe 


Xaa 


Ser 
110 


Thr 


Pro 


Ser 


Gin 


Pro 
115 


Ser 


Cys 


Lys 


Ser 


His 
120 


Leu 


Glu 


Thr 


Gly 


Thr 
125 


Lys 


Asp 


Gin 


Glu 


Thr 
130 


Val 


Cys 


Ser 


Lys 


Ala 
135 


Thr 


Glu 


Gly 


Asp 


Asn 
140 


Xaa 


Glu 


Phe 


Gly 


Ala 


Phe 


Val 


Gly 


His 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


145 










150 










155 










160 


Thr 


Ser 


Asn 


Lys 


He 
165 


Lys 


He 


Leu 


Ser 


Asn 
170 


Asn 


Asn 


Thr 


Ser 


Glu 
175 


Asn 


Thr 


Leu 


Lys 


Arg 
180 


Val 


Ser 


Ser 


Leu 


Ala 
185 


Gly 


Phe 


Thr 


Asp 


Cys 
190 


His 


Arg 


Thr 


Ser 


He 
195 


Pro 


Val 


His 


Ser 


Lys 
200 


Gin 


Glu 


Lys 


He 


Ser 
205 


Arg 


Arg 


Pro 


Ser 


Thr 
210 


Glu 


Asp 


Thr 


His 


Glu 
215 


Val 


Asp 


Ser 


Lys 


Ala 
220 


Ala 


Leu 


He 


Pro 


Val 


Cys 


Arg 


Phe 


Gin 


Leu 


Asn 


Arg 


Tyr 


He 


Leu 


Leu 


Asn 


Thr 


Leu 


Asn 


225 










230 










235 










240 



Phe Phe Arg 



<210> 7 

<211> 3532 

<212> DNA 

<213> Mus Musculus 

<220> 

<221> unsure 

<222> (2420) . . . (2420) 

<223> unknown 

<221> unsure 

<222> (2434) . . . (2434) 

<223> unknown 

<221> unsure 

<222> (2461) . . . (2461) 

<223> unknown 

<221> unsure 

<222> (2466) . . . (2466) 

<223> unknown 

<221> unsure 

<222> (2470) . . . (2470) 

<223> unknown 

<400> 7 

attatggctt atcatgaaaa accagtcctg cctcctcctc ttatcatcct cagccatata 60 

gtttcactgt tttgctgtgt atgcaaaaga agaaagaaag ataagacttc cgatgggcca 120 

aaacttttct taacagaaga agatcaaaag aaactccatg attttgaaga gcagtgtgtt 180 

gagatgtact ttgatgagaa agatgacaaa ttcaattctg ggagtgaaga gagaatccgg 24 0 

gtcacttttg aaagagtgga gcagatgagc attcagatta aagaagttgg agatcgtgtc 300 

aactacataa aaagatcatt acagtcttta gattctcaaa ttggtcatct gcaagatctc 360 
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tcagccctaa cagtagatac attgaaaaca cttacagccc agaaagcttc agaagctagt 420 

aaagtgcaca atgagatcac acgagaattg agtatttcca aacacttggc tcagaatctt 480 

attgatgatg ttcctgtaag acctttgtgg gaagaaccta gtgctgtaaa cacactgagt 540 

tcctctcttc ctcaaggtga tcgggaaagt aataatcctt ttctttgtaa tatttttatg 600 

aaagatgaaa aagaccccca atataatctg tttggacaag atttgcccgt gataccccag 660 

agaaaagaat tcaacattcc agaggctggt tcctcctgtg gtgccttatt cccaagtgct 720 

gtttctcccc cagaattacg acagagacga catggggtag aaatgttaaa aatatttaat 780 

aaaaatcaaa aattaggcag ttcacctaat agttcaccac atatgtcctc cccaccaacc 840 

aaattttctg tgagtacccc atcccagcca agttgcaaaa gtcacttgga atccacaacc 900 

aaagatcaag aacccatttt ctataaagct gcagaagggg ataacataga atttggagca 960 

tttgtgggac acagagatag tatggactta cagaggttta aagaaacatc aaacaaaata 1020 

agagaactgt tatctaatga tactcctgaa aacactctga aacatgtggg tgctgctgga 1080 

tatagtgaat gttgtaagac ttctacttct cttcactcgg tgcaagcaga aagctgtagt 1140 

agaagagcgt cgacggaaga ctctccagaa gtcgattcta aagcagcttt gttaccggat 1200 

tggttacgag atagaccatc aaacagagaa atgccatctg aaggaggaac attaaatggt 1260 

cttgcttctc catttaagcc cgttttggat acaaattact attattcagc tgtggaaaga 1320 

aataacctga tgaggttgtc acagagtatt cccttcgttc ctgtacctcc acgaggcgag 1380 

cctgtcacag tgtaccgtct ggaggagagt tctcccagta tactgaataa cagcatgtct 14 40 

tcatggtctc agctaggcct ctgtgccaaa attgagtttt taagtaaaga ggaaatggaa 1500 

ggtggtttac gaagagcagt caaagtgctg tgtacctggt cagagcacga tatcctgaag 1560 

tcagggcatc tctatatcat taagtcattt cttcctgagg tgataaacac atggtcaagc 1620 

atttataaag aagatacggt tctacatctc tgtctcagag aaatacaaca acagagagca 1680 

gcacaaaagc tcacatttgc ctttaatcag atgaaaccca aatccatacc atattctcca 1740 

aggttccttg aagttttcct gttgtactgc cattcagcag ggcagtggtt tgctgtagaa 1800 

gagtgcatga ctggtgaatt tagaaaatac aacaacaata atggtgatga aatcattcct 18 60 

acaaatactc tagaagagat catgctagcc tttagccact ggacctatga atataccaga 1920 

ggggagttac tggtacttga cttacaagga gtgggagaaa acttgactga cccatctgta 1980 

ataaaagctg aagaaaaaag atcctgtgac atggttitttg gccctgccaa tctaggagaa 2040 

gatgcaataa aaaacttcaa gagccaaaca tccactgtaa ttcttgctgt cgaaagctta 2100 

aacttcccag atttgaagag gaatgactac acgcccttga taaaattata tttcctcagg 2160 

atgagtcatc agatttgaat cttcaatctg gaaattccac caaagaatca gaagcaacaa 2220 

attctgttcg tctgatgtta tagtgctgag tcattggttt ttgcctacac ttcacaaaag 2280 

tgtaactgtc agttttcctt tcgggggaat tgatgatata ggaagatgtg tgcaaaatga 2340 

gcttgctggc cccacacata gtctagaggt aatgttctca ttgaaaaacg cctggaggtg 24 00 

gaggctgcag atgccagtgn aaagtgctag ctgncagaga gtcagtgctc tcgggctggt 24 60 

naaggncggn acccttgctg ctgagagtgg tggttctctt cacctggtgc aggaccatta 2520 

accaaagtca agtcttcaga tttgattggc tgctcagtca cagcccattc agctaaggaa 2580 

actaaattgc gcagcttttt aaatggctga agtcttcctc agtttgtgct ctatgataat 264 0 

gatgttagct ctcaactagg tgtttgtggc cacgggagaa ctactcctta caattttgct 2700 

tcacaggcat gttacaaagc ctgcactgaa aaccgtttgt cttccctctc tccctccctc 2760 

ttttccctgt agtattgagg atcaaaccca gggcctcatg aagaccattt tctaagagac 2820 

attttattta agaatcaact atagagtcta tgtttatgga tacagccagt ttttgttaaa 2880 

caaaacctga attgtgcaaa agggtttttt aacatttatc aatgttaagt aaaagaaagc 2940 

catgataaat aagaattaac tcactgttca atgggtgttt cctgtgagga aggttacagt 3000 

tgtaacagcc tgcagttgca tacatctcca aagatttaca gacttagtgt atcaaatcag 3060 

agtgtcatgt gagctctcac attgaaaatt ctataggaat gtgtcaatgt gaattctatt 3120 

tctggtactt aagaaatcag ttgttggatt atccttatac agtataggga gatcacaata 3180 

caactttatg ccaataaaat ctaacttaat tgcccagata tttttgcata tttagcaaca 3240 

agaaaagctt atcatttgac tcaagtttta tgctttctct ttcttttcat ttcctaggta 3300 

ctaattttaa tttttatttg gaaggagcag tgtaaagctt acttgtattc aatagtgtat 3360 

ctcatagata cagacaaggc cgcagagata agctgttaaa tagtgtttaa tgttgatgtg 3420 

gagagaaagg tgtattactt aaaaatacta taccatatac gttttgtata tcattaaatc 3480 

tttaaaagaa attaaattta ttcttgttta aaaaaaaaaa aaaaaaaaaa aa 3532 

<210> 8 

<211> 475 

<212> PRT 

<213> Mus Musculus 



<400> 8 
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He 


Met 


Ala 


Tyr 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


1 








5 










10 










15 




Leu 


Ser 


His 


He 


Val 


Ser 


Leu 


Phe 


Cys 


Cys 


Val 


Cys 


Lys 


Arg 


Arg 


Lys 








20 










25 










30 






Lys 


Asp 


Lys 


Thr 


Ser 


Asp Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 






35 










40 










45 








Gin 


Lys 


Lys 


Leu 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


Val 


Glu 


Met 


Tyr 


Phe 




50 










55 










60 










Asp 


Glu 


Lys 


Asp 


Asp 


Lys 


Phe 


Asn 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


65 










70 










75 










80 


Val 


Thr 


Phe 


Glu 


Arg 


Val 


Glu 


Gin 


Met 


Ser 


He 


Gin 


He 


Lys 


Glu 


Val 










85 










90 










95 




Gly 


Asp 


Arg 


Val 


Asn Tyr 


He 


Lys 


Arg 


Ser 


Leu 


Gin 


Ser 


Leu 


Asp 


Ser 








100 










105 










110 






Gin 


He 


Gly 


His 


Leu 


Gin 


Asp 


Leu 


Ser 


Ala 


Leu 


Thr 


Val 


Asp 


Thr 


Leu 






115 










120 










125 








Lys 


Thr 


Leu 


Thr 


Ala 


Gin 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 




130 










135 










140 










Glu 


He 


Thr 


Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 


Leu 


145 










150 










155 










160 


He 


Asp 


Asp 


Val 


Pro 


Val 


Arg 


Pro 


Leu 


Trp 


Glu 


Glu 


Pro 


Ser 


Ala 


Val 










165 










170 










175 




Asn 


Thr 


Leu 


Ser 


Ser 


Ser 


Leu 


Pro 


Gin 


Gly Asp Arg 


Glu 


Ser 


Asn 


Asn 








180 










185 










190 






Pro 


Phe 


Leu 


Cys 


Asn 


He 


Phe 


Met 


Lys 


Asp 


Glu 


Lys 


Asp 


Pro 


Gin 


Tyr 






195 










200 










205 








Asn 


Leu 


Phe 


Gly 


Gin 


Asp 


Leu 


Pro 


Val 


He 


Pro 


Gin 


Arg 


Lys 


Glu 


Phe 




210 










215 










220 










Asn 


He 


Pro 


Glu 


Ala 


Gly 


Ser 


Ser 


Cys 


Gly Ala 


Leu 


Phe 


Pro 


Ser 


Ala 


225 










230 










235 










240 


Val 


Ser 


Pro 


Pro 


Glu 


Leu 


Arg 


Gin 


Arg 


Arg 


His 


Gly 


Val 


Glu 


Met 


Leu 










245 










250 










255 




Lys 


He 


Phe 


Asn 


Lys 


Asn 


Gin 


Lys 


Leu 


Gly 


Ser 


Ser 


Pro 


Asn 


Ser 


Ser 








260 










265 










270 






Pro 


His 


Met 


Ser 


Ser 


Pro 


Pro 


Thr 


Lys 


Phe 


Ser 


Val 


Ser 


Thr 


Pro 


Ser 






275 










280 










285 








Gin 


Pro 


Ser 


Cys 


Lys 


Ser 


His 


Leu 


Glu 


Ser 


Thr 


Thr 


Lys 


Asp 


Gin 


Glu 




290 










295 










300 










Pro 


He 


Phe 


Tyr 


Lys 


Ala 


Ala 


Glu 


Gly 


Asp 


Asn 


He 


Glu 


Phe 


Gly 


Ala 


305 










310 










315 










320 


Phe 


Val 


Gly 


His 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


Thr 










325 










330 










335 




Ser 


Asn 


Lys 


He 


Arg 


Glu 


Leu 


Leu 


Ser 


Asn 


Asp 


Thr 


Pro 


Glu 


Asn 


Thr 








340 










345 










350 






Leu 


Lys 


His 


Val 


Gly 


Ala 


Ala 


Gly 


Tyr 


Ser 


Glu 


Cys 


Cys 


Lys 


Thr 


Ser 






355 










360 










365 








Thr 


Ser 


Leu 


His 


Ser 


Val 


Gin 


Ala 


Glu 


Ser 


Cys 


Ser 


Arg 


Arg 


Ala 


Ser 




370 










375 










380 










Thr 


Glu 


Asp 


Ser 


Pro 


Glu 


Val 


Asp 


Ser 


Lys 


Ala 


Ala 


Leu 


Leu 


Pro 


Asp 


385 










390 










395 












Trp 


Leu 


Arg 


Asp 


Arg 


Pro 


Ser 


Asn 


Arg 


Glu 


Met 


Pro 


Ser 


Glu 


Gly 


Gly 










405 










410 










415 




Thr 


Leu 


Asn 


Gly 


Leu 


Ala 


Ser 


Pro 


Phe 


Lys 


Pro 


Val 


Leu 


Asp 


Thr 


Asn 








420 










425 










430 






Tyr 


Tyr 


Tyr 


Ser 


Ala 


Val 


Glu Arg Asn 


Asn 


Leu 


Met 


Arg 


Leu 


Ser 


Gin 






435 










440 










445 








Ser 


He 


Pro 


Phe 


Val 


Pro 


Val 


Pro 


Pro 


Arg Gly Glu 


Pro 


Val 


Thr 


Val 




4 50 










455 










4 60 










Tyr 


Pro 


Ser 


Gly Gly Arg 


Val 


Leu 


Pro 


Val 


Tyr 












4 65 










470 










475 
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<210> 9 

<211> 5433 

<212> DNA 

<213> Mus Musculus 

<220> 

<221> unsure 

<222> (5094) . . . (5094) 

<223> unknown 



<400> 9 

ggctgaaaga gcctgagctg tgcctctcca ttccactgct gtggcagggt cagaaatctt 60 

ggatagagaa aaccttttgc aaacgggaat gtatctttgt aattcctagc acgaaagact 120 

ctaacaggtg ttgctgtggc cagttcacca accagcatat cccccctctg ccaagtgcaa 180 

cacccagcaa aaatgaagag gaaagcaaac aggtggagac tcagcctgag aaatggtctg 24 0 

ttgccaagca cacccagagc tacccaacag attcctatgg agttcttgaa ttccagggtg 300 

gcggatattc caataaagcc atgtatatcc gtgtatccta tgacaccaag ccagactcac 360 

tgctccatct catggtgaaa gattggcagc tggaactccc caagctctta atatctgtgc 4 20 

atggaggcct ccagaacttt gagatgcagc ccaagctgaa acaagtcttt gggaaaggcc 4 80 

tgatcaaggc tgctatgacc accggggcct ggatcttcac cgggggtgtc agcacaggtg 54 0 

ttatcagcca cgtaggggat gccttgaaag accactcctc caagtccaga ggccgggttt 600 

gtgctatagg aattgctcca tggggcatcg tggagaataa ggaagacctg gttggaaagg 660 

atgtaacaag agtgtaccag accatgtcca accctctaag taagctctct gtgctcaaca 720 

actcccacac ccacttcatc ctggctgaca atggcaccct gggcaagtat ggcgccgagg 780 

tgaagctgcg aaggctgctg gaaaagcaca tctccctcca gaagatcaac acaagactgg 84 0 

ggcagggcgt gcccctcgtg ggtctcgtgg tggagggggg ccctaacgtg gtgtccatcg 900 

tcttggaata cctgcaagaa gagcctccca tccctgtggt gatttgtgat ggcagcggac 960 

gtgcctcgga catcctgtcc tttgcgcaca agtactgtga agaaggcgga ataataaatg 1020 

agtccctcag ggagcagctt ctagttacca ttcagaaaac atttaattat aataaggcac 1080 

aatcacatca gctgtttgca attataatgg agtgcatgaa gaagaaagaa ctcgtcactg 1140 

tgttcagaat gggttctgag ggccagcagg acatcgagat ggcaatttta actgccctgc 1200 

tgaaaggaac aaacgtatct gctccagatc agctgagctt ggcactggct tggaaccgcg 1260 

tggacatagc acgaagccag atctttgtct ttgggcccca ctggacgccc ctgggaagcc 1320 

tggcaccccc gacggacagc aaagccacgg agaaggagaa gaagccaccc atggccacca 1380 

ccaagggagg aagaggaaaa gggaaaggca agaagaaagg gaaagtgaaa gaggaagtgg 144 0 

aggaagaaac tgacccccgg aagatagagc tgctgaactg ggtgaatgct ttggagcaag 1500 

cgatgctaga tgctttagtc ttagatcgtg tcgactttgt gaagctcctg attgaaaacg 1560 

gagtgaacat gcaacacttt ctgaccattc cgaggctgga ggagctctat aacacaagac 1620 

tgggtccacc aaacacactt catctgctgg tgagggatgt gaaaaagagc aaccttccgc 1680 

ctgattacca catcagcctc atagacatcg ggctcgtgct ggagtacctc atgggaggag 174 0 

cctaccgctg caactacact cggaaaaact ttcggaccct ttacaacaac ttgtttggac 1800 

caaagaggcc taaagctctt aaacttctgg gaatggaaga tgatgagcct ccagctaaag 1860 

ggaagaaaaa aaaaaaaaag aaaaaggagg aagagatcga cattgatgtg gacgaccctg 1920 

ccgtgagtcg gttccagtat cccttccacg agctgatggt gtgggcagtg ctgatgaaac 1980 

gccagaaaat ggcagtgttc ctctggcagc gaggggaaga gagcatggcc aaggccctgg 2040 

tggcctgcaa gctctacaag gccatggccc acgagtcctc cgagagtgat ctggtggatg 2100 

acatctccca ggacttggat aacaattcca aagacttcgg ccagcttgct ttggagttat 2160 

tagaccagtc ctataagcat gacgagcaga tcgctatgaa actcctgacc tacgagctga 2220 

aaaactggag caactcgacc tgcctcaaac tggccgtggc agccaaacac cgggacttca 2280 

ttgctcacac ctgcagccag atgctgctga ccgatatgtg gatgggaaga ctgcggatgc 234 0 

ggaagaaccc cggcctgaag gttatcatgg ggattcttct accccccacc atcttgtttt 24 00 

tggaatttcg cacatatgat gatttctcgt atcaaacatc caaggaaaac gaggatggca 24 60 

aagaaaaaga agaggaaaat acggatgcaa atgcagatgc tggctcaaga aagggggatg 2520 

aggagaacga gcataaaaaa cagagaagta ttcccatcgg aacaaagatc tgtgaattct 2580 

ataacgcgcc cattgtcaag ttctggtttt acacaatatc atacttgggc tacctgctgc 2 640 

tgtttaacta cgtcatcctg gtgcggatgg atggctggcc gtccctccag gagtggatcg 2700 

tcatctccta catcgtgagc ctggcgttag agaagatacg agagatcctc atgtcagaac 2760 

caggcaaact cagccagaaa atcaaagttt ggcttcagga gtactggaac aticacagatc 2820 

tcgtggccat ttccacattc atgattggag caattcttcg cctacagaac cagccctaca 2880 
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tgggctatgg ccgggtgatc tactgtgtgg atatcatctt ctggtacatc cgtgtcctgg 2940 

acatctttgg tgtcaacaag tatctggggc catacgtgat gatgattgga aagatgatga 3000 

tcgacatgct gtactttgtg gtcatcatgc tggtcgtgct catgagtttc ggagtagccc 3060 

gtcaagccat tctgcatcca gaggagaagc cctcttggaa actggcccga aacatcttct 3120 

acatgcccta ctggatgatc tatggagagg tgtttgcaga ccagatagac ctctacgcca 3180 

tggaaattaa tcctccttgt ggtgagaacc tatatgatga ggagggcaag cggcttcctc 324 0 

cctgtatccc cggcgcctgg ctcactccag cactcatggc gtgctatcta ctggtcgcca 3300 

acatcctgct ggtgaacctg ctgattgctg tgttcaacaa tactttcttt gaagtaaaat 3360 

caatatccaa ccaggtgtgg aagttccagc gatatcagct gattatgaca tttcatgaca 3420 

ggccagtcct gcccccaccg atgatcattt taagccacat ctacatcatc attatgcgtc 3480 

tcagcggccg ctgcaggaaa aagagagaag gggaccaaga ggaacgggat cgtggattga 354 0 

agctcttcct tagcgacgag gagctaaaga ggctgcatga gttcgaggag cagtgcgtgc 3600 

aggagcactt ccgggagaag gaggatgagc agcagtcgtc cagcgacgag cgcatccggg 3660 

tcacttctga aagagttgaa aatatgtcaa tgaggttgga agaaatcaat gaaagagaaa 3720 

cttttatgaa aacttccctg cagactgttg accttcgact tgctcagcta gaagaattat 3780 

ctaacagaat ggtgaatgct cttgaaaatc ttgcgggaat cgacaggtct gacccgatcc 384 0 

aggcacggtc ccgggcttct tctgaatgtg aggcaacgta tcttctccgg caaagcagca 3900 

.tcaatagcgc tgatggctac agcttgtatc gatatcattt taacggagaa gagttattat 3960 

ttgaggatac atctctctcc acgtcaccag ggacaggagt caggaaaaaa acctgttcct 4020 

tccgtataaa ggaagagaag gacgtgaaaa cgcacctagt cccagaatgt cagaacagtc 4080 

ttcacctttc actgggcaca agcacatcag caaccccaga tggcagtcac cttgcagtag 4140 

atgacttaaa gaacgctgaa gagtcaaaat taggtccaga tattgggatt tcaaaggaag 4200 

atgatgaaag acagacagac tctaaaaaag aagaaactat ttccccaagt ttaaataaaa 4260 

cagatgtgat acatggacag gacaaatcag atgttcaaaa cactcagcta acagtggaaa 4320 

cgacaaatat agaaggcact atttcctatc ccctiggaaga aaccaaaatt acacgctatt 4380 

tccccgatga aacgatcaat gcttgtaaaa caatgaagtc cagaagcttc gtctattccc 444 0 

ggggaagaaa gctggtcggt ggggttaacc aggatgtaga gtacagttca atcacggacc 4500 

agcaattgac gacggaatgg caatgccaag ttcaaaagat cacgcgctct catagcacag 4560 

atattcctta cattgtgtcg gaagctgcag tgcaagctga gcaaaaagag cagtttgcag 4 620 

atatgcaaga tgaacaccat gtcgctgaag caattcctcg aatccctcgc ttgtccctaa 4 680 

ccattactga cagaaatggg atggaaaact tactgtctgt gaagccagat caaactttgg 4740 

gattcccatc tctcaggtca aaaagtttac atggacatcc taggaatgtg aaatccattc 4800 

agggaaagtt agacagatct ggacatgcca gtagtgtaag cagcttagta attgtgtctg 4860 

gaatgacagc agaagaaaaa aaggttaaga aagagaaagc ttccacagaa actgaatgct 4 920 

agtctgtttt gtttctttaa tttttttttt taacagtcag aaacccacta atgggtgtca 4 980 

tcttggccca tcctaaacac atmtccaatt tcctaaaaac attttccctt aaaaaatttt 5040 

ggaaattcag acttgattta caatttaatg cactaaaagt agtattttgt tagnatatgt 5100 

tagtaggctt agttttttca gttgcagtag tatcaaatga aagtgatgat actgtaacga 5160 

agataaattg gctaatcagt atacaagatt atacaatctc tttattactg agggccacca 5220 

aatagcctag gaagtgccct cgagcactga agtcaccatt aggtcactca agaagtaagc 5280 

aactagctgg gcacagtggc tcatgcctgt aatcctagca ctttgggagg ccaaggcaga 534 0 

aagatagctt gagtccagga gtttgagacc agcctgggca acatagtgat accccatctc . 54 00 

ttaaaaaaaa aaaaaaaaaa ctgccctcgt gcc 5433 



<210> 10 

<211> 1533 

<212> PRT 

<213> Mus Musculus 

<400> 10 

Met Tyr He Arg Val Ser Tyr Asp Thr Lys Pro Asp Ser Leu Leu His 

15 10 15 

Leu Met Val Lys Asp Trp Gin Leu Glu Leu Pro Lys Leu Leu He Ser 

20 25 30 

Val His Gly Gly Leu Gin Asn Phe Glu Met Gin Pro Lys Leu Lys Gin 

35 40 45 

Val Phe Gly Lys Gly Leu He Lys Ala Ala Met Thr Thr Gly Ala Trp 
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lie 


Phe 


Thr 


Gly 


Gly 


Val 


Ser 


Thr 


oly 


Val 


He 


Ser 


His 


vai 


Gly Asp 


65 










70 










75 










80 


Ala 


Leu 


Lys 


Asp 


His 


Ser 


Ser 


Lys 


Ser 


Arg 


Gly Arg 


val 


Cys 


Ala 


He 










85 










on 










95 




Gly 


lie 


Ala 


Pro 


Trp 


Gly 


He 


Val 


Glu 


Asn 


Lys 


Glu 


Asp 


T Ami 

Leu 


Val 


Gly 








100 










105 










Tin 
110 






Lys 


Asp 


Val 


Thr 


Arg 


Val 


Tyr 


Gin 


Thr 


Met 


Ser 


Asn 


Pro 


Leu 


Ser 


Lys 






115 










120 


















Leu 


Ser 


Val 


Leu 


Asn 


Asn 


Ser 


His 


Thr 


His 


Phe 


He 


Leu 


Ala 


Asp Asn 




130 










135 










140 










Gly 


Thr 


Leu 


Gly 


Lys 


Tyr 


Gly Ala 


Glu 


Val 


Lys 


Leu 


Arg 


Arg 


Leu 


Leu 


145 










150 










155 










160 


Glu 


Lys 


His 


lie 


Ser 


Leu 


Gin 


Lys 


He 


Asn 


Thr 


Arg 


Leu 


Gly 


Gin 


Gly 








165 










170 










175 




Val 


Pro 


Leu 


Val 


Gly 


Leu 


Val 


Val 


Glu 


Gly 


Gly 


Pro 


Asn 


Val 


Val 


Ser 








180 










185 










190 






lie 


Val 


Leu 


Glu 


Tyr 


Leu 


Gin 


Glu 


Glu 


Pro 


Pro 


He 


Pro 


Val 


Val 


He 






195 










200 










O A C 

ZOb 








Cys 


Asp 


Gly 


Ser 


Gly Arg 


Ala 


Ser 


Asp 


He 


Leu 


Ser 


Phe 


Ala 


His 


Lys 




210 










215 










220 










Tyr 


Cys 


Glu 


Glu 


Gly 


Gly 


He 


He 


Asn 


Glu 


Ser 


Leu 


Arg 


Glu 


Gin 


Leu 


225 










230 










235 










240 


Leu 


Val 


Thr 


lie 


Gin 


Lys 


Thr 


Phe 


Asn 


Tyr 


Asn 


Lys 


Ala 


Gin 


Ser 


His 










245 










250 










255 




Gin 


Leu 


Phe 


Ala 


He 


He 


Met 


Glu 


Cys 


Met 


Lys 


Lys 


Lys 


Glu 


Leu 


Val 








260 










265 










270 






Thr 


Val 


Phe 


Arg 


Met 


Gly 


Ser 


Glu 


Gly 


Gin 


Gin 


Asp 


He 


Glu 


Met 


Ala 






275 










280 










285 








lie 


Leu 


Thr 


Ala 


Leu 


Leu 


Lys 


Gly 


Thr 


Asn 


Val 


Ser 


Ala 


Pro 


Asp 


Gin 




290 










295 










300 










Leu 


Ser 


Leu 


Ala 


Leu 


Ala 


Trp 


Asn 


Arg 


Val 


Asp 


He 


Ala 


Arg 


Ser 


Gin 


305 










310 










315 










320 


lie 


Phe 


Val 


Phe 


Gly 


Pro 


His 


Trp 


Thr 


Pro 


Leu 


Gly 


Ser 


Leu 


Ala 


Pro 










325 










330 










335 




Pro 


Thr 


Asp 


Ser 


Lys 


Ala 


Thr 


Glu 


Lys 


Glu 


Lys 


Lys 


Pro 


Pro 


Met 


Ala 








340 










34 5 










JoO 






Thr 


Thr 


Lys 


Gly Gly Arg 


Gly 


Lys 


Gly 


Lys 


Gly 


Lys 


Lys 


Lys 


Gly 


Lys 






355 










360 










ODD 








Val 


Lys 


Glu 


Glu 


Val 


Glu 


Glu 


Glu 


Thr 


Asp 


Pro 


Arg 


Lys 


He 


Glu 


Leu 




370 










375 










380 










Leu 


Asn 


Trp 


Val 


Asn 


Ala 


Leu 


Glu 


Gin 


Ala 


Met 


Leu 


Asp 


Ala 


Leu 


Val 


385 








390 










395 










400 


Leu 


Asp 


Arg 


Val 


Asp 


Phe 


Val 


Lys 


Leu 


Leu 


lie 


Glu 


Asn 


ijiy 


Val 


Asn 










405 










4 lU 










415 




Met 


Gin 


His 


Phe 


Leu 


Thr 


He 


Pro 


Arg 


Leu 


Glu 


Glu 


Leu 


Tyr 


Asn 


Thr 








420 










4 9 R 
















Arg 


Leu 


Gly 


Pro 


Pro 


Asn 


Thr 


Leu 


His 


Leu 


Leu 


Val 


Arg 


Asp 


Val 


Lys 
















440 










f* f* J 








Lys 


Ser 


Asn 


Leu 


Pro 


Pro 


Asp 


Tyr 


His 


T 1 « 

xxe 


Ser 


Leu 


T 1 c» 

± ±e 


Asp 


He 


Gly 




450 










455 










460 










Leu 


Val 


Leu 


Glu 


Tyr 


Leu 


Met 


Gly 


Gly 


Ala 


Tyr 


Arg 


Cys 


Asn 


Tyr 


Thr 


4 65 










470 










475 










480 


Arg 


Lys 


Asn 


Phe 


Arg 


Thr 


Leu 


Tyr 


Asn 


Asn 


Leu 


Phe 


Gly 


Pro 


Lys 


Arg 










485 










490 










495 




Pro 


Lys 


Ala 


Leu 


Lys 


Leu 


Leu 


Gly 


Met 


Glu 


Asp 


Asp 


Glu 


Pro 


Pro 


Ala 








500 










505 










510 






Lys 


Gly 


Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


He 


Asp 


He 






515 










520 










525 








Asp 


Val 


Asp 


Asp 


Pro 


Ala 


Val 


Ser 


Arg 


Phe 


Gin 


Tyr 


Pro 


Phe 


His 


Glu 




530 










535 










540 
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Leu 


Met 


Val 


Trp 


Ala 


Val 


Leu 


Met 


Lys 


Arg 


Gin 


Lys 


Met 


Ala 


Val 


Phe 


545 










550 










555 










560 


Leu 


Trp 


Gin 


Arg 


Gly 


Glu 


Glu 


Ser 


Met 


Ala 


Lys 


Ala 


Leu 


Val 


Ala 


Cys 










565 










570 










575 




Lys 


Leu 


Tyr 


Lys 


Ala 


Met 


Ala 


His 


Glu 


Ser 


Ser 


Glu 


Ser 


Asp 


Leu 


Val 








580 










585 










590 






Asp 


Asp 


He 


Ser 


Gin 


Asp 


Leu 


Asp 


Asn 


Asn 


Ser 


Lys 


Asp 


Phe 


Gly 


Gin 






595 










600 










605 








Leu 


Ala 


Leu 


Glu 


Leu 


Leu 


Asp 


Gin 


Ser 


Tyr 


Lys 


His 


Asp 


Glu 


Gin 


He 




610 










615 










620 










Ala 


Met 


Lys 


Leu 


Leu 


Thr 


Tyr 


Glu 


Leu 


Lys 


Asn 


Trp 


Ser 


Asn 


Ser 


Thr 


625 










630 










635 










640 


Cys 


Leu 


Lys 


Leu 


Ala 


Val 


Ala 


Ala 


Lys 


His 


Arg 


Asp 


Phe 


He 


Ala 


His 










645 










650 










655 




Thr 


Cys 


Ser 


Gin 


Met 


Leu 


Leu 


Thr 


Asp 


Met 


Trp 


Met 


Gly Arg 


Leu Arg 








660 










665 










670 






Met 


Arg 


Lys 


Asn 


Pro 


Gly 


Leu 


Lys 


Vai 


He 


Met 


Gly 


He 


Leu 


Leu 


Pro 






675 










680 










685 








Pro 


Thr 


He 


Leu 


Phe 


Leu 


Glu 


Phe 


Arg 


Thr 


Tyr 


Asp 


Asp 


Phe 


Ser 


Tyr 




690 










695 










700 










Gin 


Thr 


Ser 


Lys 


Glu 


Asn 


Glu 


Asp 


Gly Lys 


Glu 


Lys 


Glu 


Glu 


Glu 


Asn 


705 










710 










715 










720 


Thr 


Asp 


Ala 


Asn 


Ala 


Asp 


Ala 


Gly 


Ser Arg 


Lys 


Gly Asp 


Glu 


Glu 


Asn 










725 










730 










735 




Glu 


His 


Lys 


Lys 


Gin 


Arg 


Ser 


He 


Pro 


He 


Gly 


Thr 


Lys 


He 


Cys 


Glu 








740 










745 










750 






Phe 


Tyr 


Asn 


Ala 


Pro 


He 


Val 


Lys 


Phe 


Trp 


Phe 


Tyr 


Thr 


He 


Ser 


Tyr 






755 










760 










765 








Leu 


Gly 


Tyr 


Leu 


Leu 


Leu 


Phe 


Asn 


Tyr 


Val 


He 


Leu 


Val 


Arg 


Met 


Asp 




770 










775 










780 










Gly 


Trp 


Pro 


Ser 


Leu 


Gin 


Glu 


Trp 


He 


Val 


He 


Ser 


Tyr 


He 


Val 


Ser 


785 










790 










795 










800 


Leu 


Ala 


Leu 


Glu 


Lys 


He 


Arg 


Glu 


He 


Leu 


Met 


Ser 


Glu 


Pro 


Gly 


Lys 










805 










810 










815 




Leu 


Ser 


Gin 


Lys 


He 


Lys 


Val 


Trp 


Leu 


Gin 


Glu 


Tyr 


Trp 


Asn 


He 


Thr 








820 










825 










830 






Asp 


Leu 


Val 


Ala 


He 


Ser 


Thr 


Phe 


Met 


He 


Gly Ala 


He 


Leu 


Arg 


Leu 






835 










840 










845 








Gin 


Asn 


Gin 


Pro 


Tyr 


Met 


Gly 


Tyr 


Gly Arg 


Val 


He 


Tyr 


Cys 


Val 


Asp 




850 










855 










860 










He 


He 


Phe 


Trp 


Tyr 


He 


Arg 


Val 


Leu 


Asp 


He 


Phe 


Gly 


Val 


Asn 


Lys 


865 










870 










875 










880 


Tyr 


Leu 


Gly 


Pro 


Tyr 


Val 


Met 


Met 


He 


Gly 


Lys 


Met 


Met 


He 


Asp 


Met 










885 










890 










895 




Leu 


Tyr 


Phe 


Val 


Val 


He 


Met 


Leu 


Val 


Val 


Leu 


Met 


Ser 


Phe 


Gly 


Val 








900 










905 










910 






Ala 


Arg 


Gin 


Ala 


He 


Leu 


His 


Pro 


Glu 


Glu 


Lys 


Pro 


Ser 


Trp 


Lys 


Leu 






915 










920 










925 








Ala 


Arg 


Asn 


He 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


He 


Tyr 


Gly 


Glu 


Val 




930 










935 










940 










Phe 


Ala 


Asp 


Gin 


He 


Asp 


Leu 


Tyr 


Ala 


Met 


Glu 


He 


Asn 


Pro 


Pro 


Cys 


945 










950 










955 










960 


Gly 


Glu 


Asn 


Leu 


Tyr 


Asp 


Glu 


Glu 


Gly 


Lys 


Arg 


Leu 


Pro 


Pro 


Cys 


He 










965 










970 










975 




Pro 


Gly 


Ala 


Trp 


Leu 


Thr 


Pro 


Ala 


Leu 


Met 


Ala 


Cys 


Tyr 


Leu 


Leu 


Val 








980 










985 










990 






Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


Ala 


Val 


Phe 


Asn 


Asn 


Thr 






995 










1000 








1005 






Phe 


Phe 


Glu 


Val 


Lys 


Ser 


He 


Ser 


Asn 


Gin 


Val 


Trp 


Lys 


Phe 


Gin 


Arg 




1010 








1015 








1020 
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Tyr Gin Leu He Met Thr Phe His Asp Arg Pro Val Leu Pro Pro Pro 
1025 1030 1035 104 

Met He lie Leu Ser His He Tyr He He He Met Arg Leu Ser Gly 

1045 1050 1055 

Arg Cys Arg Lys Lys Arg Glu Gly Asp Gin Glu Glu Arg Asp Arg Gly 

1060 1065 1070 

Leu Lys Leu Phe Leu Ser Asp Glu Glu Leu Lys Arg Leu His Glu Phe 

1075 1080 1085 

Glu Glu Gin Cys Val Gin Glu His Phe Arg Glu Lys Glu Asp Glu Gin 

1090 1095 1100 

Gin Ser Ser Ser Asp Glu Arg He Arg Val Thr Ser Glu Arg Val Glu 
1105 1110 1115 112 

Asn Met Ser Met Arg Leu Glu Glu He Asn Glu Arg Glu Thr Phe Met 

1125 1130 1135 

Lys Thr Ser Leu Gin Thr Val Asp Leu Arg Leu Ala Gin Leu Glu Glu 

1140 1145 1150 

Leu Ser Asn Arg Met Val Asn Ala Leu Glu Asn Leu Ala Gly He Asp 

1155 1160 1165 

Arg Ser Asp Leu He Gin Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu 

1170 1175 1180 

Ala Thr Tyr Leu Leu Arg Gin Ser Ser He Asn Ser Ala Asp Gly Tyr 
1185 1190 1195 120 

Ser Leu Tyr Arg Tyr His Phe Asn Gly Glu Glu Leu Leu Phe Glu Asp 

1205 1210 1215 

Thr Ser Leu Ser Thr Ser Pro Gly Thr Gly Val Arg Lys Lys Thr Cys 

1220 1225 1230 

Ser Phe Arg He Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro 

1235 1240 1245 

Glu Cys Gin Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser Ala 

1250 1255 1260 

Thr Pro Asp Gly Ser His Leu Ala Val Asp Asp Leu Lys Asn Ala Glu 
1265 1270 1275 128 

Glu Ser Lys Leu Gly Pro Asp He Gly He Ser Lys Glu Asp Asp Glu 

1285 1290 1295 

Arg Gin Thr Asp Ser Lys Lys Glu Glu Thr He Ser Pro Ser Leu Asn 

1300 1305 1310 

Lys Thr Asp Val He His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr 

1315 1320 1325 

Gin Leu Thr Val Glu Thr Thr Asn He Glu Gly Thr He Ser Tyr Pro 

1330 1335 1340 

Leu Glu Glu Thr Lys He Thr Arg Tyr Phe Pro Asp Glu Thr He Asn 
1345 1350 1355 136 

Ala Cys Lys Thr Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg 

1365 1370 1375 

Lys Leu Val Gly Gly Val Asn Gin Asp Val Glu Tyr Ser Ser He Thr 

1380 1385 1390 

Asp Gin Gin Leu Thr Thr Glu Trp Gin Cys Gin Val Gin Lys He Thr 

1395 1400 1405 

Arg Ser His Ser Thr Asp He Pro Tyr He Val Ser Glu Ala Ala Val 

1410 1415 1420 

Gin Ala Glu Gin Lys Glu Gin Phe Ala Asp Met Gin Asp Glu His His 
1425 1430 1435 144 

Val Ala Glu Ala He Pro Arg He Pro Arg Leu Ser Leu Thr He Thr 

1445 1450 1455 

Asp Arg Asn Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr 

1460 1465 1470 

Leu Gly Phe Pro Ser Leu Arg Ser Lys Ser Leu His Gly His Pro Arg 

1475 1480 1485 

Asn Val Lys Ser He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser 
1490 1495 1500 
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Ser Val Ser Ser Leu Val He Val Ser Gly Met Thr Ala Glu Glu Lys 
1505 1510 1515 152 

Lys Val Lys Lys Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1525 1530 

<210> 11 

<211> 6220 

<212> DNA 

<213> Homo Sapiens 

<400> 11 

tgtgcagaat tgtacagttg cgaaaccatg tcgctggcag ctggtgctgg cggtggagac 60 

ttccctgtgc ggtgctcagt gcatctgcac ccgtggggga gggagctctt tctctggccc 120 

tgcagtcacc tgaggttgtt accattatga acggccgctg ggacccccgc atgtgcatgt 180 

actcccccag agtgtccggg ggccccagcc aagggacaca tctcacgcag ctgggaacat 240 

gtgcaggctg atgaagagaa ccggatgagg gcttcacatg aggaagcatg tggccaggtc 300 

ctctcagaac atcagcctca tcttcctgtc tctgatctat ttcaccaacc accccatgtg 360 

tctctagaac cccagtgtag cgagctggag agaggactgt cctgagggca gcaggcctgg 420 

ttgcagctgg cgtgggggtc tcagaatgga gccctcagcc ctgaggaaag ctggctcgga 480 

gcaggaggag ggctttgagg ggctgcccag aagggtcact gacctgggga tggtctccaa 540 

tctccggcgc agcaacagca gcctcttcaa gagctggagg ctacagtgcc ccttcggcaa 600 

caatgacaag caagaaagcc t:cagttcgtg gattcctgaa aacatcaaga agaaagaatg 660 

cgtgtatttt gtggaaagtt ccaaactgtc tgatgctggg aaggtggtgt gtcagtgtgg 720 

ctacacgcat gagcagcact tggaggaggc taccaagccc cacaccttcc agggcacaca 7 80 

gtgggaccca aagaaacatg tccaggagat gccaaccgat gcctttggcg acatcgtctt 84 0 

cacgggcctg agccagaagg tgaaaaagta cgtccgagtc tcccaggaca cgccctccag 900 

cgtgatctac cacctcatga cccagcactg ggggctggac gtccccaatc tcttgatctc 960 

ggtgaccggg ggggccaaga acttcaacat gaagccgcgg ctgaagagca ttttccgcag 1020 

aggcctggtc aaggtggctc agaccacagg ggcctggatc atcacagggg ggtcccacac 1080 

cggcgtcatg aagcaggtag gcgaggcggt gcgggacttc agcctgagca gcagctacaa 114 0 

ggaaggcgag ctcatcacca tcggagtcgc cacctggggc actgtccacc gccgcgaggg 1200 

cctgatccat cccacgggca gcttccccgc cgagtacata ctggatgagg atggccaagg 1260 

gaacctgacc tgcctagaca gcaaccactc tcacttcatc ctcgtggacg acgggaccca 1320 

cggccagtac ggggtggaga ttcctctgag gaccaggctg gagaagttca tatcggagca 1380 

gaccaaggaa agaggaggtg tggccatcaa gatccccatc gtgtgcgtgg tgctggaggg 1440 

cggcccgggc acgttgcaca ccatcgacaa cgccaccacc aacggcaccc cctgtgtggt 1500 

tgtggagggc tcgggccgcg tggccgacgt cattgcccag gtggccaacc tgcctgtctc 1560 

ggacatcact atctccctga tccagcagaa actgagcgtg ttcttccagg agatgtttga 1620 

gaccttcacg gaaagcagga ttgtcgagtg gaccaaaaag atccaagata ttgtccggag 1680 

gcggcagctg ctgactgtct tccgggaagg caaggatggt cagcaggacg tggatgtggc 174 0 

catcttgcag gccttgctga aagcctcacg gagccaagac cactttggcc acgagaactg 1800 

ggaccaccag ctgaaactgg cagtggcatg gaatcgcgtg gacattgccc gcagtgagat 1860 

cttcatggat gagtggcagt ggaagccttc agatctgcac cccacgatga cagctgcact 1920 

catctccaac aagcctgagt ttgtgaagct cttcctggaa aacggggtgc agctgaagga 1980 

gtttgtcacc tgggacacct tgctctacct gtacgagaac ctggacccct cctgcctgtt 204 0 

ccacagcaag ctgcaaaagg tgctggtgga ggatcccgag cgcccggctt gcgcgcccgc 2100 

ggcgccccgc ctgcagatgc accacgtggc ccaggtgctg cgggagctgc tgggggactt 2160 

cacgcagccg ctttatcccc ggccccggca caacgaccgg ctgcggctcc tgctgcccgt 2220 

tccccacgtc aagctcaacg tgcagggagt gagcctccgg tccctctaca agcgttcctc 2280 

aggccatgtg accttcacca tggaccccat ccgtgacctt ctcatttggg ccattgtcca 2340 

gaaccgtcgg gagctggcag gaatcatctg ggctcagagc caggactgca tcgcagcggc 2400 

cttggcctgc agcaagatcc tgaaggaact gtccaaggag gaggaggaca cggacagctc 24 60 

ggaggagatg ctggcgctgg cggaggagta tgagcacaga gccatcgggg tcttcaccga 2520 

gtgctaccgg aaggacgaag agagagccca gaaactgctc acccgcgtgt ccgaggcctg 2580 

ggggaagacc acctgcctgc agctcgccct ggaggccaag gacatgaagt ttgtgtctca 264 0 

cgggggcatc caggccttcc tgaccaaggt gtggtggggc cagctctccg tggacaatgg 2700 

gctgtggcgt gtgaccctgt gcatgctggc cttcccgctg ctcctcaccg gcctcatctc 2760 

cttcagggag aagaggctgc aggatgtggg cacccccgcg gcccgcgccc gtgccttctt 2820 

caccgcaccc gtggtggtct tccacctgaa catcctctcc tacttcgcct tcctctgcct 2880 

gttcgcctac gtgctcatgg tggacttcca gcctgtgccc tcctggtgcg agtgtgccat 2940 
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ctacctctgg ctcttctcct tggtgtgcga ggagatgcgg cagctcttct atgaccctga 3000 

cgagtgcggg ctgatgaaga aggcagcctt gtacttcagt gacttctgga ataagctgga 3060 

cgtcggcgca atcttgctct tcgtggcagg gctgacctgc aggctcatcc cggcgacgct 3120 

gtaccccggg cgcgtcatcc tctctctgga cttcatcctg ttctgcctcc ggctcatgca 3180 

catttttacc atcagtaaga cgctggggcc caagatcatc attgtgaagc ggatgatgaa 3240 

ggacgtcttc ttcttcctct tcctgctggc tgtgtgggtg gtgtccttcg gggtggccaa 3300 

gcaggccatc ctcatccaca acgagcgccg ggtggactgg ctgttccgag gggccgtcta 3360 

ccactcctac ctcaccatct tcgggcagat cccgggctac atcgacggtg tgaacttcaa 3420 

cccggagcac tgcagcccca atggcaccga cccctacaag cctaagtgcc ccgagagcga 34 80 

cgcgacgcag cagaggccgg ccttccctga gtggctgacg gtcctcctac tctgcctcta 3540 

cctgctcttc accaacatcc tgctgctcaa cctcctcatc gccatgttca actacacctt 3600 

ccagcaggtg caggagcaca cggaccagat ttggaagttc cagcgccatg acctgatcga 3660 

ggagtaccac ggccgccccg ccgcgccgcc ccccttcatc ctcctcagcc acctgcagct 3720 

cttcatcaag agggtggtcc tgaagactcc ggccaagagg cacaagcagc tcaagaacaa 3780 

gctggagaag aacgaggagg cggccctgct atcctgggag atctacctga aggagaacta 3840 

cctccagaac cgacagttcc agcaaaagca gcggcccgag cagaagatcg aggacatcag 3900 

caataaggtt gacgccatgg tggacctgct ggacctggac ccactgaaga ggtcgggctc 3960 

catggagcag aggttggcct ccctggagga gcaggtggcc cagacagccc gagccctgca 4 020 

ctggatcgtg aggacgctgc gggccagcgg cttcagctcg gaggcggacg tccccactct 4 080 

ggcctcccag aaggccgcgg aggagccgga tgctgagccg ggaggcagga agaagacgga 4140 

ggagccgggc gacagctacc acgtgaatgc ccggcacctc ctctacccca actgccctgt 4 200 

cacgcgcttc cccgtgccca acgagaaggt gccctgggag acggagttcc tgatctatga 4 260 

cccacccttt tacacggcag agaggaagga cgcggccgcc atggacccca tgggagacac 4 320 

cctggagcca ctgtccacga tccagtacaa cgtggtggat ggcctgaggg accgccggag 4 380 

cttccacggg ccgtacacag tgcaggccgg gttgcccctg aaccccatgg gccgcacagg 4 4 40 

actgcgtggg cgcgggagcc tcagctgctt cggacccaac cacacgctgt accccatggt 4 500 

cacgcggtgg aggcggaacg aggatggagc catctgcagg aagagcataa agaagatgct 4560 

ggaagtgctg gtggtgaagc tccctctctc cgagcactgg gccctgcctg ggggctcccg 4 620 

ggagccaggg gagatgctac ctcggaagct gaagcggatc ct:ccggcagg agcactggcc 4 680 

gtcttttgaa aacttgctga agtgcggcat ggaggtgtac aaaggctaca tggatgaccc 4740 

gaggaacacg gacaatgcct ggatcgagac ggtggccgtc agcgtccact tccaggacca 4800 

gaatgacgtg gagctgaaca ggctgaactc taacctgcac gcctgcgact cgggggcctc 4860 

catccgatgg caggtggtgg acaggcgcat cccactctat gcgaaccaca agaccctcct 4 920 

ccagaaggca gccgctgagt tcggggctca ctactgactg tgccctcagg ctgggcggct 4 980 

ccagtccata gacgttcccc ccagaaacca gggcttctct ctcctgagcc tggccaggac 504 0 

tcaggctgtt cctgggccct gcacatgatg gggtttggtg gacccagtgc ccctcacggc 5100 

tgccgcaagt ctgctgcaga tgacctcatg aactggaagg ggtcaaggtg acccgggagg 5160 

agagctcaag acagggcaca ggctactcag agctgagggg cccctgggac ccttggccat 5220 

caggcgaggg gctgggcctg tgcagctggg cccttggcca gagtccactc ccttcctggc 5280 

tgtgtcaccc cgagcagctc atccaccatg gaggtcattg gcctgaggca agttccccgg 5340 

agagtcggga tcccctgtgg ccccctcagg cctatgtctg tgaggaaggg gccctgccac 5400 

tctccccaag agggcctcca tgtttcgagg tgcctcaaca tggagccttg cctggcctgg 54 60 

gctaggggca ctgtctgaac tcctgactgt caggataaac tccgtggggg tacaggagcc 5520 

cagacaaagc ccaggcctgt caagagacgc agagggcccc tgccagggtt ggccccaggg 5580 

accctgggac gaggctgcag aagctctccc tccctactcc ctgggagcca cgtgctggcc 5 64 0 

atgtggccag ggacggcatg agcaggaggc ggggacgtgg gggccttctg gtttggtgtc 57 00 

aacagctcac aggagcgtga accatgaggg ccctcaggag gggaacgtgg taaaacccaa 57 60 

gacattaaat ctgccatctc aggcctggct ggctcttctg tgctttccac aaataaagtt 5820 

cctgacacgt ccagggccag gggctgtgtg acggctgcct gaagttctcc tcgatccccc 5880 

ggtgagcttc ctgcagcctg tggatgtcct gcagcccctc agccctaccc ccaagtttct 5940 

cctctgaccc atcagctccc tgtcttcatt ttcctaaacc tgggctccag catcgtcccc 6000 

aagcccacca ggccaggatg caggcatcca catgccctcc tccttggctt cccctgcgtg 6060 

gtggtgccaa tgtgccctgg cacccctgca gaggctccgg atggagcctg gggctgcctg 6120 

gccactgagc actggccgag gtgatgccca cccttccctg gacaggcctc tgtcttccac 6180 

ctgacccaaa gctctctagc cacccccttg tccccagtat 6220 
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<212> PRT 

<213> Homo Sapiens 





<400> 


12 


Met 


Glu 


Pro 


Ser 


1 








Phe 


Glu 


Gly 


Leu 








20 


Leu 


Arg 


Arg 


Ser 










Pro 


Phe 


Gly 


Asn 




50 






Glu 


Asn 


He 


Lys 


65 








Leu 


Ser 


Asp 


Ala 


Gin 


His 


Leu 


Glu 








100 


Trp 


Asp 


Pro 


Lys 






115 




Asp 


He 


Val 


Phe 




130 






Val 


Ser 


Gin 


Asp 


145 








His 


Trp 


Gly 


Leu 


Ala 


Lys 


Asn 


Phe 








180 


Gly 


Leu 


Val 


Lys 






195 




Gly 


Ser 


His 


Thr 




210 






Phe 


Ser 


Leu 


Ser 


225 








Val 


Ala 


Thr 


Trp 


Thr 


Gly 


Ser 


Phe 








260 


Asn 


Leu 


Thr 


Cys 






27 5 




Asp 


Gly 


Thr 


n J. o 




290 






Leu 


Glu 


Lys 


Phe 


305 








lie 


Lys 


He 


Pro 


Leu 


His 


Thr 


He 








340 


Val 


Glu 


Gly 


Ser 






355 




Leu 


Pro 


Val 


Ser 




370 






Val 


Phe 


Phe 


Gin 


385 








Glu 


Trp 


Thr 


Lys 


Thr 


Val 


Phe 


Arg 








420 


He 


Leu 


Gin 


Ala 






435 





Ala 


Leu 


Arg 


Lys 


5 








Pro 


Arg 


Arg 


Val 


Asn 


Ser 


Ser 


Leu 








40 


Asn 


Asp 


Lys 


Gin 










Lys 


Lys 


Glu 


Cys 




70 






Gly 


Lys 


Val 


Val 


85 








Glu 


Ala 


Thr 


Lys 


Lys 


His 


Val 


Gin 








120 


Thr 


Gly 


Leu 


Ser 






135 




Thr 


Pro 


Ser 


Ser 




150 






Asp 


Val 


Pro 


Asn 


165 








Asn 


Met 


Lys 


Pro 


Val 


Ala 


Gin 


Thr 








200 


Gly 


Val 


Met 


Lys 






^ ID 




Ser 


Ser 


Tyr 


Lys 




230 






Gly 


Thr 


Val 


His 


245 








Pro 


Ala 


Glu 


Tyr 


Leu 


Asp 


Ser 


Asn 








280 


Gly 


Gin 


Tyr 


Gly 






295 




He 


Ser 


Glu 


Gin 




310 






He 


Val 


Cys 


Val 


325 








Asp 


Asn 


Ala 


Thr 


Gly 


Arg 


Val 


Ala 








360 


Asp 


He 


Thr 


He 






375 




Glu 


Met 


Phe 


Glu 




390 






Lys 


He 


Gin 


Asp 


405 








Glu 


Gly 


Lys 


Asp 


Leu 


Leu 


Lys 


Ala 



Ala 


Gly Ser 


Glu 




10 






Thr 


Asp 


Leu 


Gly 


25 








Phe 


Lys 


Ser 


Trp 


Glu 


Ser 


Leu 


Ser 








60 


Val 


Tyr 


Phe 


Val 






75 




Cys 


Gin 


Cys 


Gly 




90 






Pro 


His 


Thr 


Phe 


105 








Glu 


Met 


Pro 


Thr 


Gin 


Lys 


Val 


Lys 








140 


Val 


He 


Tyr 


His 






155 




Leu 


Leu 


He 


Ser 




170 






Arg 


Leu 


Lys 


Ser 


185 








Thr 


Gly Ala 


Trp 


Gin 


Val 


Gly 


Glu 








220 


Glu 


Gly 


Glu 


Leu 






235 




Arg 


Arg 


Glu 


Gly 




250 






He 


Leu 


Asp 


Glu 


265 








His 


Ser 


His 


Phe 


Val 


Glu 


He 


Pro 








300 


Thr 


Lys 


Glu 


Arg 






315 




Val 


Leu 


Glu 


Gly 




330 






Thr 


Asn 


Gly 


Thr 


345 








Asp 


Val 


He 


Ala 


Ser 


Leu 


He 


Gin 








380 


Thr 


Phe 


Thr 


Glu 






395 




He 


Val Arg 


Arg 




410 






Gly Gin Gin 


Asp 


425 








Ser 


Arg 


Ser 


Gin 



Gin Glu Glu Gly 
15 

Met Val Ser Asn 
30 

Arg Leu Gin Cys 
45 

Ser Trp He Pro 

Glu Ser Ser Lys 
80 

Tyr Thr His Glu 
95 

Gin Gly Thr Gin 
110 

Asp Ala Phe Gly 
125 

Lys Tyr Val Arg 

Leu Met Thr Gin 
160 

Val Thr Gly Gly 
175 

He Phe Arg Arg 
190 

He He Thr Gly 
205 

Ala Val Arg Asp 

He Thr He Gly 
240 

Leu He His Pro 
255 

Asp Gly Gin Gly 
270 

He Leu Val Asp 
285 

Leu Arg Thr Arg 

Gly Gly Val Ala 
320 

Gly Pro Gly Thr 

335 

Pro Cys Val Val 
350 

Gin Val Ala Asn 
365 

Gin Lys Leu Ser 

Ser Arg He Val 
400 

Arg Gin Leu Leu 
415 

Val Asp Val Ala 
430 

Asp His Phe Gly 
445 
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His 


Glu 


Asn 


Trp 


Asp 


His 


Gin 


Leu 


Lys 


Leu 


Ala 


Val 


Ala 


Trp 


Asn 


Arg 




450 










455 










4 60 










Val 


Asp 


He 


Ala 


Arg 


Ser 


Glu 


lie 


Ptie 


Met 


Asp 


Glu 


Trp 


Gin 


Trp 


Lys 


465 










470 










475 










480 


Pro 


Ser 


Asp 


Leu 


His 


Pro 


Thr 


Met 


Thr 


Ala 


Ala 


Leu 


He 


Ser 


Asn 


Lys 










485 










4 90 










4 95 




Pro 


Glu 


Phe 


Val 


Lys 


Leu 


Phe 


Leu 


Glu 


Asn 


Gly Val 


Gin 


Leu 


Lys 


Glu 








500 










505 










510 






Phe 


Val 


Thr 


Trp 


Asp 


Thr 


Leu 


Leu 


Tyr 


Leu 


Tyr 


Glu 


Asn 


Leu 


Asp 


Pro 






515 










OZO 










525 








Ser 


Cys 


Leu 


Phe 


His 


Ser 


Lys 


Leu 


Gin 


Lys 


Val 


Leu 


Val 


Glu 


Asp 


Pro 




530 










535 










540 










Glu 


Arg 


Pro 


Ala 


Cys 


Ala 


Pro 


Ala 


Ala 


Pro 


Arg 


Leu 


Gin 


Met 


His 


His 


545 










550 










555 










560 


Val 


Ala 


Gin 


Val 


Leu 


Arg 


Glu 


Leu 


Leu 


Gly 


Asp 


Phe 


Thr 


Gin 


Pro 


Leu 










565 










570 










575 




Tyr 


Pro 


Arg 


Pro 


Arg 


His 


Asn 


Asp 


Arg 


Leu 


Arg 


Leu 


Leu 


Leu 


Pro 


Val 








580 










585 










590 






Pro 


His 


Val 


Lys 


Leu 


Asn 


Val 


Gin 


Gly 


Val 


Ser 


Leu 


Arg 


Ser 


Leu 


Tyr 






595 










600 










605 








Lys 


Arg 


Ser 


Ser 


Gly 


His 


Val 


Thr 


Phe 


Thr 


Met 


Asp 


Pro 


He 


Arg 


Asp 




610 










615 










620 










Leu 


Leu 


He 


Trp 


Ala 


He 


Val 


Gin 


Asn 


Arg 


Arg 


Glu 


Leu 


Ala 


Gly 


He 


625 










630 










635 










640 


lie 


Trp 


Ala 


Gin 


Ser 


Gin 


Asp 


Cys 


He 


Ala 


Ala 


Ala 


Leu 


Ala 


Cys 


Ser 










645 










650 










655 




Lys 


He 


Leu 


Lys 


Glu 


Leu 


Ser 


Lys 


Glu 


Glu 


Glu Asp 


Thr 


Asp 


Ser 


Ser 








660 










665 










670 






Glu 


Glu 


Met 


Leu 


Ala 


Leu 


Ala 


Glu 


Glu 


Tyr 


Glu 


His 


Arg 


Ala 


He 


Gly 






675 










680 










685 








Val 


Phe 


Thr 


Glu 


Cys 


Tyr 


Arg 


Lys 


Asp 


Glu 


Glu Arg Ala 


Gin 


Lys 


Leu 




690 










695 










700 










Leu 


Thr 


Arg 


Val 


Ser 


Glu 


Ala 


Trp 


Gly 


Lys 


Thr 


Thr 


Cys 


Leu 


Gin 


Leu 


705 










710 










715 










720 


Ala 


Leu 


Glu 


Ala 


Lys 


Asp 


Met 


Lys 


Phe 


Val 


Ser 


His 


Gly 


Gly 


He 


Gin 










725 










730 










735 




Ala 


Phe 


Leu 


Thr 


Lys 


Val 


Trp 


Trp 


Gly 


Gin 


Leu 


Ser 


Val 


Asp 


Asn 


Gly 








740 










.745 










750 






Leu 


Trp 


Arg 


Val 


Thr 


Leu 


Cys 


Met 


Leu 


Ala 


Phe 


Pro 


Leu 


Leu 


Leu 


Thr 






755 










7 60 










765 








Giy 


Leu 


He 


Ser 


Phe 


Arg 


Glu 


Lys 


Arg 


Leu 


Gin 


Asp 


Val 


Gly 


Thr 


Pro 




770 










775 










780 










Ala 


Ala 


Arg 


Ala 


Arg 


Ala 


Phe 


Phe 


Thr 


Ala 


Pro 


Val 


Val 


Val 


Phe 


His 


785 










790 










795 










800 


Leu 


Asn 


He 


Leu 


Ser 


Tyr 


Phe 


Ala 


Phe 


Leu 


Cys 


Leu 


Phe 


Ala 


Tyr 


Val 










805 










810 










815 




Leu 


Met 


Val 


Asp 


Phe 


Gin 


Pro 


Val 


Pro 


Ser 


Trp 


Cys 


Glu 


Cys 


Ala 


He 








820 










825 










830 






Tyr 


Leu 


Trp 


Leu 


Phe 


Ser 


Leu 


Val 


Cys 


Glu 


Glu 


Met 


Arg 


Gin 


Leu 


Phe 






835 










840 










845 








Tyr 


Asp 


Pro 


Asp 


Glu 


Cys 


Gly 


Leu 


Met 


Lys 


Lys 


Ala 


Ala 


Leu 


Tyr 


Phe 




850 










855 










860 










Ser 


Asp 


Phe 


Trp 


Asn 


Lys 


Leu 


Asp 


Val 


Gly 


Ala 


He 


Leu 


Leu 


Phe 


Val 


865 










870 










875 










880 


Ala 


Gly 


Leu 


Thr 


Cys 


Arg 


Leu 


He 


Pro 


Ala 


Thr 


Leu 


Tyr 


Pro 


Gly 


Arg 










885 










890 










895 




Val 


He 


Leu 


Ser 


Leu 


Asp 


Phe 


He 


Leu 


Phe 


Cys 


Leu 


Arg 


Leu 


Met 


His 








900 










905 










910 






He 


Phe 


Thr 


He 


Ser 


Lys 


Thr 


Leu 


Gly 


Pro 


Lys 


He 


He 


He 


Val 


Lys 






915 










920 










925 
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Arg 


Met 
930 


Met 


Lys 


Asp 


Val 


Phe 
935 


Phe 


Phe 


Leu 


Phe 


Leu 
940 


Leu 


Ala 


Val 


Trp 


Val 


Val 


Ser 


Phe 


Gly 


Val 


Ala 


Lys 


Gin 


Ala 


He 


Leu 


He 


His 


Asn 


Glu 


945 










950 










955 










960 


Arg 


Arg 


Val 


Asp 


Trp 
965 


Leu 


Phe 


Arg 


Gly 


Ala 
970 


Val 


Tyr 


His 


Ser 


Tyr 
975 


Leu 


Thr 


He 


Phe 


Gly 
980 


Gin 


He 


Pro 


Gly 


Tyr 
985 


He 


Asp 


Gly 


Val 


Asn 
990 


Phe 


Asn 


Pro 


Glu 


His 


Cys 


Ser 


Pro 


Asn 


Gly 


Thr 


Asp 


Pro 


Tyr 


Lys 


Pro 


Lys 


Cys 






995 










1000 








1005 






Pro 


Glu 


Ser 


Asp 


Ala 


Thr 


Gin 


Gin 


Arg 


Pro 


Ala 


Phe 


Pro 


Glu 


Trp 


Leu 




1010 








1015 








1020 








Thr 


Val 


Leu 


Leu 


Leu 


Cys 


Leu 


Tyr 


Leu 


Leu 


Phe 


Thr 


Asn 


He 


Leu 


Leu 


1025 








1030 








1035 








104 


Leu 


Asn 


Leu 


Leu 


He 


Ala 


Met 


Phe 


Asn 


Tyr 


Thr 


Phe 


Gin 


Gin 


Val 


Gin 










1045 








1050 








1055 


Glu 


His 


Thr 


Asp 


Gin 


He 


Trp 


Lys 


Phe 


Gin 


Arg 


His 


Asp 


Leu 


He 


Glu 








1060 








1065 








1070 




Glu 


Tyr 


His 


Gly Arg 


Pro 


Ala 


Ala 


Pro 


Pro 


Pro 


Phe 


He 


Leu 


Leu 


Ser 






1075 








1080 








1085 






His 


Leu 


Gin 


Leu 


Phe 


He 


Lys 


Arg 


Val 


Val 


Leu 


Lys 


Thr 


Pro 


Ala 


Lys 




1090 








1095 








1100 








Arg 


His 


Lys 


Gin 


Leu 


Lys 


Asn 


Lys 


Leu 


Glu 


Lys 


Asn 


Glu 


Glu 


Ala 


Ala 


1105 








1110 








1115 








112 


Leu 


Leu 


Ser 


Trp 


Glu 


He 


Tyr 


Leu 


Lys 


Glu 


Asn 


Tyr 


Leu 


Gin 


Asn 


Arg 










1125 








1130 








1135 


Gin 


Phe 


Gin 


Gin 


Lys 


Gin 


Arg 


Pro 


Glu 


Gin 


Lys 


He 


Glu 


Asp 


He 


Ser 








1140 








1145 








1150 




Asn 


Lys 


Val 


Asp 


Ala 


Met 


Val 


Asp 


Leu 


Leu 


Asp 


Leu 


Asp 


Pro 


Leu 


Lys 






1155 








1160 








1165 






Arg 


Ser 


Gly 


Ser 


Met 


Glu 


Gin 


Arg 


Leu 


Ala 


Ser 


Leu 


Glu 


Glu 


Gin 


Val 




1170 








1175 








1180 








Ala 


Gin 


Thr 


Ala 


Arg 


Ala 


Leu 


His 


Trp 


He 


Val 


Arg 


Thr 


Leu 


Arg 


Ala 


1185 








1190 








1195 








120 


Ser 


Gly 


Phe 


Ser 


Ser 


Glu 


Ala 


Asp 


Val 


Pro 


Thr 


Leu 


Ala 


Ser 


Gin 


Lys 










1205 








1210 








1215 


Ala 


Ala 


Glu 


Glu 


Pro 


Asp 


Ala 


Glu 


Pro 


Gly 


Gly Arg 


Lys 


Lys 


Thr 


Glu 








1220 








1225 








1230 




Glu 


Pro 


Gly 


Asp 


Ser 


Tyr 


His 


Val 


Asn 


Ala 


Arg 


His 


Leu 


Leu 


Tyr 


Pro 






1235 








1240 








1245 






Asn 


Cys 


Pro 


Val 


Thr 


Arg 


Phe 


Pro 


Val 


Pro 


Asn 


Glu 


Lys 


Val 


Pro 


Trp 




1250 








1255 








1260 








Glu 


Thr 


Glu 


Phe 


Leu 


He 


Tyr 


Asp 


Pro 


Pro 


Phe 


Tyr 


Thr 


Ala 


Glu 


Arg 


1265 








1270 








1275 








128 


Lys 


Asp 


Ala 


Ala 


Ala 


Met 


Asp 


Pro 


Met 


Gly 


Asp 


Thr 


Leu 


Glu 


Pro 


Leu 










1285 








1290 








1295 


Ser 


Thr 


He 


Gin 


Tyr 


Asn 


Val 


Val 


Asp 


Gly 


Leu 


Arg 


Asp 


Arg 


Arg 


Ser 








1300 








1305 








1310 




Phe 


His 


Gly 


Pro 


Tyr 


Thr 


Val 


Gin 


Ala 


Gly 


Leu 


Pro 


Leu 


Asn 


Pro 


Met 






1315 








1320 








1325 






Gly Arg 


Thr 


Gly Leu Arg 


Gly Arg 


Gly 


Ser 


Leu 


Ser 


Cys 


Phe 


Gly 


Pro 




1330 








1335 








1340 








Asn 


His 


Thr 


Leu 


Tyr 


Pro 


Met 


Val 


Thr 


Arg 


Trp Arg Arg 


Asn 


Glu 


Asp 


1345 








1350 








1355 








136 


Gly Ala 


He 


Cys 


Arg 


Lys 


Ser 


He 


Lys 


Lys 


Met 


Leu 


Glu 


Val 


Leu 


Val 










1365 








1370 








1375 


Val 


Lys 


Leu 


Pro 


Leu 


Ser 


Glu 


His 


Trp 


Ala 


Leu 


Pro Gly Gly 


Ser Arg 








1380 








1385 








1390 




Glu 


Pro 


Gly 


Glu 


Met 


Leu 


Pro Arg 


Lys 


Leu 


Lys 


Arg 


He 


Leu 


Arg 


Gin 



1395 1400 1405 
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Glu 


His 


Trp 


Pro 


Ser 


Phe 


Glu 


Asn 


Leu 


Leu 


Lys 


Cys 


Gly Met 


Glu 


Val 




1410 








1415 








1420 








Tyr 


Lys 


Gly 


Tyr 


Met 


Asp 


Asp 


Pro 


Arg 


Asn 


Thr 


Asp Asn 


Ala 


Trp 


He 


1425 








1430 








1435 








144 


Glu 


Thr 


Val 


Ala 


Val 


Ser 


Val 


His 


Phe 


Gin 


Asp 


Gin 


Asn 


Asp 


Val 


Glu 










1445 








1450 








1455 


Leu 


Asn 


Arg 


Leu 


Asn 


Ser 


Asn 


Leu 


His 


Ala 


Cys 


Asp 


Ser 


Gly Ala 


Ser 








1460 








1465 








1470 




He 


Arg 


Trp 


Gin 


Val 


Val 


Asp 


Arg 


Arg 


He 


Pro 


Leu 


Tyr 


Ala 


Asn 


His 






1475 








1480 








1485 






Lys 


Thr 


Leu 


Leu 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Phe 


Gly 


Ala 


His 


Tyr 






1490 








1495 








1500 










<210> 


13 




























<211> 


1816 


























<212> 


PRT 




























<213> 


C . Elegans 
























<400> 


13 


























Met 


He 


Thr 


Asp 


Lys 


Asn 


Leu 


Phe 


Ser 


Arg 


Leu 


Leu 




Lys 


Lys 


Asn 


1 


















10 










15 




Pro 


He 


Arg 


Met 


His 


Ser 


Pr-o 

trX. iJ 




Phe 


Ser 


Phe 


Ser 


Leu 


He 


Thr 


Ser 


















25 










30 






Leu 


Phe 


Phe 


Thr 


Gin 


Pne 






Phe 


Gin 


Leu 






Met 


Ala 


Tyr 






35 




















A R 








Phe 


Phe 


Leu 


Thr 


Leu 


±±e 


Ala 


Glv 


Val 


Thr 


His 


Phe 




Phe 


Pro 


Glu 




50 










DD 










an 

DV/ 










Lys 


Leu 


Leu 


o±y 


Lys 


Ser 


Glu 


A n 


Leu 


Asp 


His 


Arg 


xyxT 


Gin 


Ser 


Ser 


65 










r \J 










75 










80 


Glu 


Gin 


Lys 


"17-5 1 

vail. 


Leu 


J. j.e 


Glu 


Tim 


Thr 


Glu 


Asn 


ys 


Ala 


Val 


Ala 


Glu 










85 










90 










95 




Ser 


Leu 


Arg 


Ala 


Asn 


Ser 


Val 


Thr 


Val 


Glu 


Glu 


Asn 


Glu 


Ser 


Glu 


Arg 








100 










105 










110 






Glu 


Thr 


Glu 


Thr 


Gin 


Thr 


Lys 


Arg 


Arg 


Arg 


Lys 


Lys 


Gin 


Arg 


Ser 


Thr 






115 










120 










125 








Ser 


Ser 


Asp 


Lys 


Ala 


Pro 


Leu 


Asn 


Ser 


Ala 


Pro 


Arg 


His 


Val 


Gin 


Lys 




130 










135 










140 










Phe 


Asp 


Trp 


Lys 


Asp 


Met 


Leu 


His 


Leu 


Ala 


Asp 


He 


Ser 


Gly 


Arg 


Lys 


145 










150 










155 










160 


Arg 


Gly 


Asn 


Ser 


Thr 


Thr 


Ser 


His 


Ser 


Gly 


His 


Ala 


Thr 


Arg 


Ala 


Gly 










165 










170 










175 




Ser 


Leu 


Lys 


Gly 


Lys 


Asn 


Trp 


He 


Glu 


Cys 


Arg 


Leu 


Lys 


Met 


Arg 


Gin 








180 










185 










190 






Cys 


Ser 


Tyr 


Phe 


Val 


Pro 


Ser 


Gin 


Arg 


Phe 


Ser 


Glu 


Arg 


Cys 


Gly 


Cys 






195 










200 










205 








Gly 


Lys 


Glu 


Arg 


Ser 


Lys 


His 


Thr 


Glu 


Glu 


Val 


Leu 


Glu 


Arg 


Ser 


Gin 




210 










215 










220 










Asn 


Lys 


Asn 


His 


Pro 


Leu 


Asn 


His 


Leu 


Thr 


Leu 


Pro 


Gly 


He 


His 


Glu 


225 










230 










235 










240 


Val 


Asp 


Thr 


Thr 


Asp 


Ala 


Asp 


Ala 


Asp Asp 


Asn 


Glu 


Val 


Asn 


Leu 


Thr 










245 










250 










255 




Pro 


Gly 


Arg 


Trp 


Ser 


He 


Gin 


Ser 


His 


Thr 


Glu 


He 


Val 


Pro 


Thr 


Asp 








260 










265 










270 






Ala 


Tyr 


Gly 


Asn 


He 


Val 


Phe 


Glu 


Gly 


Thr 


Ala 


His 


His 


Ala 


Gin 


Tyr 






275 










260 










285 








Ala 


Arg 


He 


Ser 


Phe 


Asp 


Ser 


Asp 


Pro Arg 


Asp 


He 


Val 


His 


Leu 


Met 




290 










295 










300 










Met 


Lys 


Val 


Trp 


Lys 


Leu 


Lys 


Pro 


Pro 


Lys 


Leu 


He 


He 


Thr 


He 


Asn 


305 










310 










315 










320 


Gly 


Gly 


Leu 


Thr 


Lys 


Phe 


Asp 


Leu 


Gin 


Pro 


Lys 


Leu 


Ala 


Arg 


Thr 


Phe 
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325 



Arg 


Lys 


Gly 


He 


Met 


Lys 


He 


Ala 








340 










Thr 


Ser 


Gly 


Leu 


Asp 


Glu 


Gly 


Val 






355 










360 


His 


Ala 


Leu 


Glu 


Phe 


Trp 


Ser 


Phe 




370 










375 




Asp 


Val 


Leu 


Leu 


Ala 


His 


Ser 


Met 


385 










390 






Asp 


His 


Gly 


Asn 


His 


Thr 


Ser 


Lys 










405 








Ala 


Ser 


Trp 


Gly 


Met 


Leu 


Lys 


Gin 








420 










Ser 


Thr 


Val 


Thr 


Tyr 


Ala 


Thr 


Asn 






435 










440 


Glu 


Leu 


Asn 


Asp 


Asn 


His 


Ser 


Tyr 




450 










455 




Val 


Asn 


Arg 


Tyr 


Gly 


Ala 


Glu 


He 


465 










470 






Tyr 


Leu 


Ala 


Gin 


Gly 


Asp 


Lys 


Lys 










485 








Val 


Val 


Leu 


Glu 


Gly 


Gly 


Ala 


Phe 








500 










Val 


Thr 


Thr 


He 


Pro 


Arg 


He 


Pro 






515 










520 


Arg 


Ala 


Ala 


Asp 


He 


Leu 


Ala 


Phe 




530 










535 




Gly 


Phe 


Leu 


Ser 


Asp 


Asn 


He 


Arg 


545 










550 






Arg 


He 


Phe 


Gly Tyr 


Asp 


Pro 


Lys 










565 








He 


Val 


Glu 


Cys 


Ser 


Thr 


Asn 


Lys 








580 










Gly 


Glu 


Ser 


Ser 


Arg 


Glu 


Asp 


Leu 






595 










600 


Leu 


Lys 


Gly 


Gin 


Asn 


Leu 


Ser 


Pro 




610 










615 




Ala 


Trp 


Asn 


Arg 


Ala 


Asp 


He 


Ala 


625 










630 






Thr 


Glu 


Trp 


Thr 


Thr 


Gin 


Asp 


Leu 










645 








Ser 


Asn 


Asp 


Arg 


He 


Asp 


Phe 


Val 








660 










Ser 


Met 


Gin 


Lys 


Phe 


Leu 


Thr 


Tyr 






675 










680 


Thr 


Asp 


Lys 


Gly 


Pro 


Gin 


Asn 


Thr 




690 










695 




Ser 


Lys 


His 


His 


He 


Lys 


Leu 


Val 


705 










710 






Leu 


Met 


Gly 


Asn 


Leu 


Tyr 


Lys 


Ser 










725 








Asn 


Gin 


Tyr 


Phe 


Leu 


Phe 


Asn 


Asn 








740 










His 


Ser 


Asn 


Ser 


Asn 


Gly 


Gly 


Arg 






755 










760 


Asp 


Ala 


Gly 


Arg 


Glu 


Arg 


Met 


Ser 




770 










775 




Asn 


Ala 


Arg 


Asn 


Ser 


He 


He 


Ser 


785 










790 






Arg 


Glu 


Ser 


Asp Asp 


Glu 


Asp 


Asp 





-IQ 
















330 










335 




Lys 


Ser 


Thr 


Asp 


Ala 


Trp 


He 


He 


34 5 










350 






Val 


Lys 


His 


Leu 


Asp 


Ser 


Ala 


Leu 










365 








Gly 


Leu 


Phe 


Trp 


Val 


He 


Gin 


Leu 








380 










Phe 


He 


Pro 


Arg 


Gly 


Ser 


Leu 


Phe 






395 










400 


Asn 


His 


Val 


Val 


Ala 


He 


Gly 


He 




410 










415 




Arg 


Ser 


Arg 


Phe 


Val 


Gly 


Lys 


Asp 


425 










4 30 






Val 


Phe 


Asn 


Asn 


Thr 


Arg 


Leu 


Lys 










445 








Phe 


Leu 


Phe 


Ser Asp Asn Gly 


Thr 








4 60 










He 


Met 


Arg 


Lys 


Arg 


Leu 


Glu 


Ala 






475 










480 


Arg 


Ser 


Ala 


He 


Pro 


Leu 


Val 


Cys 




490 










495 




Thr 


He 


Lys 


Met 


Val 


His 


Asp 


Tyr 


505 










510 






Val 


He 


Val 


Cys 


Asp 


Gly 


Ser 


Gly 










525 








Ala 


His 


Gin 


Ala 


Val 


Ser 


Gin 


Asn 








540 










Asn 


Gin 


Leu 


Val 


Asn 


He 


Val 


Arg 






555 










560 


Thr 


Ala 


Gin 


Lys 


Leu 


He 


Lys 


Gin 




570 










575 




Ser 


Leu 


Met 


Thr 


He 


Phe 


Arg 


Leu 


585 










590 






Asp 


His 


Val 


He 


Met 


Ser 


Cys 


Leu 










605 








Pro 


Glu 


Gin 


Leu 


Gin 


Leu 


Ala 


Leu 








620 










Arg 


Thr 


Glu 


He 


Phe 


Ala 


Asn 


Gly 






635 










640 


His 


Asn 


Ala 


Met 


He 


Glu 


Ala 


Leu 




650 










655 




His 


Leu 


Leu 


Leu 


Glu 


Asn 


Gly 


Val 


665 










670 






Gly 


Arg 


Leu 


Glu 


His 


Leu 


Tyr 


Asn 










685 








Leu 


Arg 


Thr 


Asn 


Leu 


Leu 


Val 


Asp 








700 










Glu 


Val 


Gly Arg 


Leu 


Val 


Glu 


Asn 






715 










720 


Asn 


Tyr 


Thr 


Lys 


Glu 


Glu 


Phe 


Lys 




730 










735 




Arg 


Lys 


Gin Phe Gly 


Lys 


Arg 


Val 


745 










750 






Asn 


Asp 


Val 


He 


Gly 


Pro 


Ser 


Gly 










765 








Ser 


Met 


Gin 


He 


Ser 


Leu 


He 


Asn 








780 










Leu 


Phe 


Asn Gly Gly Gly Arg 


Lys 






795 










800 


Phe 


Ser 


Asn 


Leu 


Glu 


Glu 


Glu 


Ala 
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805 










810 










815 




Asn 


Met 


Asp 


Phe 


Thr 


Phe 


Arg 


Tyr 


Pro 


Tyr 


Ser 


Asp 


Leu 


Met 


He 


Trp 








820 










825 










830 






Ala 


Val 


Leu 


Thr 


Lys 


Arg 


Gin 


Lys 


Met 


Ala 


Lys 


Leu 


Met 


Trp 


Thr 


His 






835 










840 










845 








Gly 


Glu 


Glii 


Gly 


Met 


Ala 


Lys 


Ala 


Leu 


Val 


Ala 


Ser 


Arg 


Leu 


Tyr 


Val 




850 










855 










860 










Ser 


Leu 


Ala 


Lys 


Thr 


Ala 


Ser 


Leu 


Ala 


Thr 


Gly 


Glu 


He 


Gly 


Met 


Ser 


865 










870 










875 










880 


Gin 


Asp 


Phe 


Thr 


Glu 


Phe 


Ser 


Asp 


Glu 


Phe 


Ser 


Glu 


Leu 


Ala 


Val 


Glu 










885 










890 










895 




Val 


Leu 


Glu 


Tyr 


Cys 


Thr 


Lys 


His 


Gly 


Arg 


Asp 


Gin 


Thr 


Leu 


Arg 


Leu 








900 










905 










910 






Leu 


Thr 


Cys 


Glu 


Leu 


Ala 


Asn 


Trp 


Gly 


Asp 


Glu 


Thr 


Cys 


Leu 


Ser 


Leu 






915 










920 










925 








Ala 


Ala 


Asn 


Asn 


Gly 


His 


Arg 


Lys 


Phe 


Leu 


Ala 


His 


Pro 


Cys 


Cys 


Gin 




930 










935 










940 










Met 


Leu 


Leu 


Ser 


Asp 


Leu 


Trp 


Gin 


Gly 


Gly Leu 


Leu 


Met 


Lys 


Asn 


Asn 


945 










950 










955 










960 


Gin 


Asn 


Ser 


Lys 


Val 


Leu 


Thr 


Cys 


Leu 


Ala 


Ala 


Pro 


Pro 


Leu 


He 


Phe 










965 










970 










975 




Leu 


Leu 


Gly 


Phe 


Lys 


Thr 


Lys 


Glu 


Gin 


Leu 


Met 


Leu 


Gin 


Pro 


Lys 


Thr 








980 










985 










990 






Ala 


Ala 


Glu 


His 


Asp 


Glu 


Glu 


Met 


Ser 


Asp 


Ser 


Glu 


Met 


Asn 


Ser 


Ala 






995 










1000 








1005 






Glu 


Asp 


Thr Asp 


Thr 


Ser 


Ser 


A.sp 


Ser 


Ser 


Ser 


Asp 


Ser 


Asp 


Asp 


Ser 




1010 








1015 








1020 








Asp 


Glu 


Glu 


Asp 


Ala 


Lys 


Leu 


Arg 


Ala 


Gin 


Ser 


Leu 


Ser 


Ala 


Asp 


Gin 


1025 








1030 








1035 








104 


Pro 


Leu 


Ser 


He 


His 


Arg 


Leu 


Val 


Arg 


Asp 


Lys 


Leu 


Asn 


Phe 


Ser 


Glu 










1045 








1050 








1055 


Lys 


Lys 


Lys 


Pro 


Asp 


Met 


Gly 


He 


Ser 


Arg 


He 


Val 


Val 


Ala 


Pro 


Pro 








1060 








1065 








1070 




He 


Val 


Thr 


Gly Arg 


Asn 


Arg 


Ala 


Arg 


Thr 


Met 


Ser 


He 


Lys 


Lys 


Ser 






1075 








1080 








1085 






Lys 


Lys 


Asn 


Val 


He 


Lys 


Pro 


Pro 


Ala 


Cys 


Leu 


Lys 


He 


Glu 


Thr 


Ser 



1090 1095 1100 



Asp Asp Asp Glu Gin Glu Gin Lys Lys Ala Thr Glu Met Cys Lys Ser 
1105 1110 1115 112 

Thr Phe Phe Asp Phe Phe Phe Asp Phe Pro Tyr He Asn Arg Thr Gly 

1125 1130 1135 

Lys Arg Gly Ser Val Ala Val Ala Met Asn His Asp Asp Met Tyr He 

1140 1145 1150 

Asp Pro Ser Glu Glu Leu Asp Thr Gin Thr Arg Gin Lys Ser Ser Arg 

1155 1160 1165 

Glu Phe Ser Ser Ser Arg Asn Val Thr Val Gin Val Tyr Thr Gin Arg 

1170 1175 1180 

Pro Leu Ser Trp Lys Lys Lys He Met Glu Phe Tyr Lys Ala Pro He 
1185 1190 1195 120 

Thr Thr Tyr Trp Leu Trp Phe Phe Ala Phe He Trp Phe Leu He Leu 

1205 1210 1215 

Leu Thr Tyr Asn Leu Leu Val Lys Thr Gin Arg He Ala Ser Trp Ser 

1220 1225 1230 

Glu Trp Tyr Val Phe Ala Tyr He Phe Val Trp Thr Leu Glu He Gly 

1235 1240 ^ 1245 

Arg Lys Val Val Ser Thr He Met Met Asp Thr Ser Lys Pro Val Leu 

1250 1255 1260 

Lys Gin Leu Arg Val Phe Phe Phe Gin Tyr Arg Asn Gly Leu Leu Ala 
1265 1270 1275 128 

Phe Gly Leu Leu Thr Tyr Leu He Ala Tyr Phe He Arg Leu Ser Pro 
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1285 1290 1295 

Thr Thr Lys Thr Leu Gly Arg lie Leu lie lie Cys Asn Ser Val lie 

1300 1305 1310 

Trp Ser Leu Lys Leu Val Asp Tyr Leu Ser Val Gin Gin Gly Leu Gly 

1315 1320 1325 

Pro Tyr lie Asn lie Val Ala Glu Met lie Pro Thr Met lie Pro Leu 

1330 1335 1340 

Cys Val Leu Val Phe lie Thr Leu Tyr Ala Phe Gly Leu Leu Arg Gin 
1345 1350 1355 136 

Ser lie Thr Tyr Pro Tyr Glu Asp Trp His Trp He Leu Val Arg Asn 

1365 1370 1375 

He Phe Leu Gin Pro Tyr Phe Met Leu Tyr Gly Glu Val Tyr Ala Ala 

1380 1385 1390 

Glu He Asp Thr Cys Gly Asp Glu He Trp Gin Thr His Glu Asp Glu 

1395 1400 1405 

Asn He Pro He Ser Met Leu Asn Val Thr His Glu Thr Cys Val Pro 

1410 1415 1420 

Gly Tyr Trp He Ala Pro Val Gly Leu Thr Val Phe Met Leu Ala Thr 
1425 1430 1435 144 

Asn Val Leu Leu Met Asn Val Met Val Ala Gly Cys Thr Tyr He Phe 

1445 1450 1455 

Glu Lys His He Gin Ser Thr Arg Glu He Phe Leu Phe Glu Arg Tyr 

1460 1-465 1470 

Gly Gin Val Met Glu Tyr Glu Ser Thr Pro Trp Leu Pro Pro Pro Phe 

1475 1480 1485 

Thr He He Tyr His Val He Trp Leu Phe Lys Leu He Lys Ser Ser 

1490 1495 1500 

Ser Arg Met Phe Glu Arg Lys Asn Leu Phe Asp Gin Ser Leu Lys Leu 
1505 1510 1515 152 

Phe Leu Ser Pro Asp Glu Met Glu Lys Val His Thr Phe Glu Glu Glu 

1525 1530 1535 

Ser Val Glu Asp Met Lys Arg Glu Thr Glu Lys Lys Asn Leu Ser Ser 

1540 1545 1550 

Asn Asp Glu Arg He His Arg Thr Ala Glu Arg Thr Asp Ala He Leu 

1555 1560 1565 

Asn Arg Val Ser His Leu Thr Gin Leu Glu Phe Thr Leu Lys Glu Glu 

1570 1575 1580 

He Arg Glu Leu Glu His Lys Met Lys Asn Met Asp Ser Arg His Lys 
1585 1590 1595 160 

Glu Gin Met Asn Leu Met Leu Asp Met Asn Lys Lys Leu Gly Lys Phe 

1605 1610 1615 

He Ser Gly Lys Tyr Lys Arg Gly Ser Phe Gly Gly Ser Gly Ser Asp 

1620 1625 1630 

Gly Gly Gly Gly Ser Ser Asp Asn Ser Lys Leu Glu Pro Asn Asn Ser 

1635 1640 1645 

Val Pro Met He Thr Val Asp Gly Pro Ser Pro He Gly Ser Arg Arg 

1650 1655 1660 
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Trp Gly Leu 


Leu 


Lys 


Lys 


Arg 


Glu 


Asp 










325 










330 










335 




Phe 


He 


Gly 


Gin 


Asp 


Lys 


Thr 


Val 


Pro 


Tyr 


Tyr 


Pro 


Ser 


Ser 


Ser 


Lys 








340 










345 










350 






Gly Arg 


Phe 


Thr 


Gly 


Leu 


Asn 


Asn Arg 


His 


Ser 


Tyr 


Phe 


Leu 


Leu 


Val 
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360 


Asp 


Asn 


oxy 


1 nx 


VaX 


laX y 


Arg 


Tyr 




J / u 
















Leo 


oXU 


net. 


Tyr 


T 1 a 

xxe 




uXf 1 




















oer 


\7a 1 

va± 


xrro 


vax 


vax 


Cys 


Vox 










^ U 3 








T 1 o 




Car- 
OCX 


V aX 


Leu 


Asp 


Tyr 


VciX 


















Vo J, 


Va± 


Cys 


Asp 


laX y 


oex 


oX y 


Arg 






4 JO 










4 4 n 


flJLS 




Asn 


vax 


1 nr 


GXU 


Asp 


Gx y 




D U 










*i J z> 






vax 


Leu 


Leu 


Leu 


vax 


GXU 


1 nr 


A 
















A 1 

A±a 


His 


Arg 


Leu 


Leu 


His 


GIU 


Leu 










4 o D 








Lgu 


Leu 


A nr 


T 1 c» 

xxe 


r) v» 

irne 


Arg 


Leu 


ox y 


















His 


Ala 


He 


Leu 


Thr 


Ala 


Leu 


Leu 
















u 


Asp 




Leu 


Axa 


Leu 


AX a 


Leu 


Ala 




D JU 










R "5 




Ser 


Asp 


Val 


Pne 


Ala 


Met 


Gly 


His 












e; ^ n 






Asn 


Ala 


Met 


Met 


Glu 


Ala 


Leu 


He 










c; ^ c: 








Leu 


Leu 


Leu 


GXU 


Gin 


Gly 


T T « 

lie 


Asn 








c; Q A 










Arg 


Leu 


Asp 


Glu 


Leu 


Tyr 


Asn 


Thr 
















DU u 




Tyr 


xxe 


vai 


Arg 


Asp 


vai 


Val 




Di.U 










ox 3 




Lys 


Leu 


Pro 


Asp 


x±e 


Gly 


Leu 


vai 


625 










£')n 






Tyr 


Gin 


Cys 


oer 


Tyr 


1 nr 


1 nr 


Ser 










64 5 








Arg 


Met 


i^ys 


Arg 


vax 


iiys 


nxS 


AX a 








660 










Ser 


Ser 


Arg 


fro 


oer 


Arg 


J. nr 


vjxy 






o / ^ 










n 

oo u 


Thr 


Glu 


uxy 




oxy 


Gxy 


vax 


Gxy 




690 










695 




Phe 


Gly 


Asn 


Ser 


PHa 
Xrlie 


v3xy 


Asn 


ftl n 
uXIl 


705 










710 






Asn Arg 


Ser 


AX a 


Leu 


Ser 


oxy 


Ser 


















Trp Arg 


OcX 


AT A 
nXct 




Arg 


vsxy 


Asn 








740 










Pro 


Asn 


Leu 


Gly 


Asp 


Ser 


Arg 


Asp 






755 










760 


Leu 


Ser 


Leu 


Thr 


Ser 


Ala 


Ser 


Asp 




770 










775 




Arg 


Tyr 


Pro 


Tyr 


Ser 


Glu 


Leu 


Met 


785 










790 






Gin 


Asp 


Met 


Ala 


Met 


Cys 


Met 


Trp 










805 








Lys 


Ala 


Leu 


Val 


Ala 


Cys 


Arg 


Leu 








820 










Ala 


Glu 


Asp 


Tyr 


Leu 


Glu 


Val 


Glu 



-26- 











365 








Gly Ala 


uXu 


Val 


He 


Leu 


Arg 


Lys 








380 










Lys 


Gin 


Lys 


He 


Phe 


ux y 


GX y 


TKr 
X fix 






O -7 3 










Ana 


vai 


Leu 


uXU 


Gly Gly 


Ser 


Cys 


TKv- 

X nr 


















1 nr 


Asn 


VaX 


Pro 


Arg 


vai 


Pro 


VclX 


















Aia 


Aia 


Asp 


Leu 


Leu 


AX a 


DKa 

fne 


Axa 










4 4 5 








Leu 


Leu 


Pro 


Asp 


Asp 


T 1 a 

xxe 


Arg 


Arg 








4 60 










Thr 


Pne 


ux y 


Cys 


Ser 


GIU 


Ai a 


AX a 






H f D 










4 ft n 

ft o u 


Thr 


Val 


Cys 


Ala 


Gin 


His 


Lys 


Asn 














4 y D 




Glu 


Gin 


G J. y 


Glu 


His 


Asp 


vai 


Asp 


R n CI 










D 1 U 






Lys 


Gly 


Gin 


Asn 


Leu 


Ser 


Ala 


Ala 










525 








Trp 


Asn 


Arg 


Val 


Asp 


He 


Ala 


Arg 








54 0 










Glu 


Trp 


Pro 


Gin 


Ala 


Ala 


Leu 


His 






CI CI c: 
ODD 










D DU 


His 


Asp 


Arg 


Val 


Asp 


Phe 


Val 


Arg 




570 










tn CI 
D / D 




Met 


Gin 


Lys 


Phe 


Leu 


Thr 


He 


Ser 


585 










D yu 






Asp 


Lys 


Gly 


Pro 


Pro 


Asn 


Thr 


lieu 










605 








Arg 


Val 


Arg 


Gin 


Gly 


Tyr 


Arg 


OK^ 

irne 








620 










He 


Glu 


Lys 


Leu 


Met 


Gly 


Asn 


Ser 
















09 u 


Glu 


Phe 


Arg 


Asp 


Lys 


Tyr 


Lys 


Gin 




650 










ODD 




Gin 


Lys 


Lys 


Ala 


Met 


Giy 


vai 


PKa 


665 










670 






Ser 


Gly 


T 1 a 

lie 


Ala 


Ser 


Arg 


Gin 


oer 










685 








Gly Gly 


Qa T- 


Ser 


Val 


AXo 


oxy 


VclX 








700 










Asp 


Pro 


Pr-rk 
irx O 


Leu Asp 


Pro 


JIX 5 


VclX 






71 S 

f XZJ 










720 


Arg Ala 


Leu 


Ser 


Asn 


nxa 


T 1 a 
xxq 


Leu 




730 










7 JZJ 




Phe 


Pro 


Ala 


Asn 


Pro 


Met 


Arg 


Pro 


745 










750 






Cys 


Gly 


Ser 


Glu 


Phe 


Asp 


Glu 


Glu 










765 








Gly Ser 


Gin 


Thr 


Glu 


Pro 


Asp 


Phe 








780 










He 


Trp 


Ala 


Val 


Leu 


Thr 


Lys 


Arg 






795 










800 


Gin 


His 


Gly 


Glu 


Glu 


Ala 


Met 


Ala 




810 










815 




Tyr 


Lys 


Ser 


Leu 


Ala 


Thr 


Glu 


Ala 


825 










830 






He 


Cys 


Glu 


Glu 


Leu 


Lys 


Lys 


Tyr 
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835 










84C 










845 








Ala 


Glu 
850 


Glu 


Phe 


Arg 


He 


Leu 
855 


Ser 


Leu 


Glu 


Leu 


Leu 
860 


Asp 


His 


Cys 


Tyr 


His 


Val 


Asp 


Asp 


Ala 


Gin 


Thr 


Leu 


Gin 


Leu 


Leu 


Thr 


Tyr 


Glu 


Leu 


Ser 


865 










870 










875 










880 


Asn 


Trp 


Ser 


Asn 


Glu 
885 


Thr 


Cys 


Leu 


Ala 


Leu 
890 


Ala 


Val 


He 


Val 


Asn 
895 


Asn 


Lys 


His 


Phe 


Leu 
900 


Ala 


His 


Pro 


Cys 


Cys 
905 


Gin 


He 


Leu 


Leu 


Ala 
910 


Asp 


Leu 


Trp 


His 


Gly 
915 


Gly 


Leu 


Arg 


Met 


Arg 
920 


Thr 


His 


Ser 


Asn 


He 
925 


Lys 


Val 


Val 


Leu 


Gly 
930 


Leu 


He 


Cys 


Pro 


Pro 
935 


Phe 


He 


Gin 


Met 


Leu 
940 


Glu 


Phe 


Lys 


Thr 


Arg 


Glu 


Glu 


Leu 


Leu 


Asn 


Gin 


Pro 


Gin 


Thr 


Ala 


Ala 


Glu 


His 


Gin 


Asn 


945 










950 










955 










960 


Asp 


Met 


Asn 


Tyr 


Ser 
965 


Ser 


Ser 


Ser 


Ser 


Ser 
970 


Ser 


Ser 


Ser 


Ser 


Ser 
975 


Ser 


Ser 


Ser 


Ser 


Ser 
980 


Ser 


Asp 


Ser 


Ser 


Ser 
985 


Phe 


Glu 


Asp 


Asp 


Asp 
990 


Asp 


Glu 


Asn 


Asn 


Ala 


His 


Asn 


His 


Asp 


Gin 


Lys 


Arg 


Thr Arg 


Lys 


Thr 


Ser 


Gin 






995 










1000 








1005 






Gly 


Ser 


Ala 


Gin 


Ser 


Leu 


Asn 


He 


Thr 


Ser 


Leu 


Phe 


His 


Ser 


Arg 


Arg 




1010 








1015 








1020 








Arg 


Lys 


Ala 


Lys 


Lys 


Asn 


Glu 


Lys 


Cys 


Asp 


Arg 


Glu 


Thr 


Asp 


Ala 


Ser 


1025 








1030 








1035 








104 


Ala 


Cys 


Glu 


Ala 


Gly 


Asn 


Arg 


Gin 


He 


Gin 


Asn 


Gly 


Gly 


Leu 


Thr 


Ala 










1045 








1050 








1055 


Glu 


Tyr 


Gly 


Thr 


Phe 


Gly 


Glu 


Ser 


Asn 


Gly 


Val 


Ser 


Pro 


Pro 


Pro 


Pro 








1060 








1065 








1070 




Tyr 


Met 


Arg 


Ala 


Asn 


Ser 


Arg 


Ser 


Arg 


Tyr 


Asn 


Asn 


Arg 


Ser 


Asp 


Met 






1075 








1080 








1085 






Ser 


Lys 


Thr 


Ser 


Ser 


Val 


He 


Phe 


Gly 


Ser 


Asp 


Pro 


Asn 


Leu 


Ser 


Lys 




1090 








1095 








1100 








Leu 


Gin 


Lys 


Ser 


Asn 


He 


Thr 


Ser 


Thr 


Asp 


Arg 


Pro 


Asn 


Pro 


Met 


Glu 


1105 








1110 








1115 








112 


Gin 


Phe 


Gin 


Gly 


Thr 


Arg 


Lys 


He 


Lys 


Met 


Arg Arg 


Arg 


Phe 


Tyr 


Glu. 










1125 








1130 








1135 


Phe 


Tyr 


Ser 


Ala 


Pro 


He 


Ser 


Thr 


Phe 


Trp 


Ser 


Trp 


Thr 


He 


Ser 


Phe 








1140 








1145 








1150 




lie 


Leu 


Phe 


He 


Thr 


Phe 


Phe 


Thr 


Tyr 


Thr 


Leu 


Leu 


Val 


Lys 


Thr 


Pro 






1155 








1160 








1165 






Pro 


Arg 


Pro 


Thr 


Val 


He 


Glu 


Tyr 


He 


Leu 


He 


Ala 


Tyr 


Val 


Ala 


Ala 




1170 








1175 








1180 








Phe 


Gly Leu 


Glu 


Gin 


Val 


Arg 


Lys 


He 


He 


Met 


Ser Asp Ala 


Lys 


Pro 


1185 








1190 








1195 








120 


Phe 


Tyr 


Glu 


Lys 


He 


Arg 


Thr 


Tyr 


Val 


Cys 


Ser 


Phe 


Trp Asn 


Cys 


Val 










1205 








1210 








1215 


Thr 


He 


Leu 


Ala 


He 


He 


Phe 


Tyr 


He 


Val 


Gly 


Phe 


Phe 


Met 


Arg 


Cys 








1220 








1225 








1230 




Phe 


Gly Ser 


Val 


Ala 


Tyr 


Gly Arg 


Val 


He 


Leu 


Ala 


Cys 


Asp 


Ser 


Val 






1235 








1240 








1245 






Leu 


Trp 


Thr 


Met 


Lys 


Leu 


Leu 


Asp 


Tyr 


Met 


Ser 


Val 


His 


Pro 


Lys 


Leu 




1250 








1255 








1260 








Gly 


Pro 


Tyr 


Val 


Thr 


Met 


Ala 


Gly 


Lys 


Met 


He 


Gin 


Asn 


Met 


Ser 


Tyr 


1265 








1270 








1275 








128 


He 


He 


Val 


Met 


Leu 


Val 


Val 


Thr 


Leu 


Leu 


Ser 


Phe 


Gly 


Leu 


Ala 


Arg 










1285 








1290 








1295 


Gin 


Ser 


He 


Thr 


Tyr 


Pro Asp 


Glu 


Thr 


Trp 


His 


Trp 


He 


Leu 


Val 


Arg 








1300 








1305 








1310 




Asn 


He 


Phe 


Leu 


Lys 


Pro 


Tyr 


Phe 


Met 


Leu 


Tyr 


Gly Glu Val 


Tyr 


Ala 
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1315 1320 1325 

Asp Glu lie Asp Thr Cys Gly Asp Glu Ala Trp Asp Gin His Leu Glu 

1330 1335 1340 

Asn Gly Gly Pro Val lie Leu Gly Asn Gly Thr Thr Gly Leu Ser Cys 
1345 1350 1355 136 

Val Pro Gly Tyr Trp lie Pro Pro Leu Leu Met Thr Phe Phe Leu Leu 

1365 1370 1375 

lie Ala Asn lie Leu Leu Met Ser Met Leu lie Ala lie Phe Asn His 

1380 1385 1390 

lie Phe Asp Ala Thr Asp Glu Met Ser Gin Gin lie Trp Leu Phe Gin 

1395 1400 1405 

Arg Tyr Lys Gin Val Met Glu Tyr Glu Ser Thr Pro Phe Leu Pro Pro 

1410 1415 1420 

Pro Leu Thr Pro Leu Tyr His Gly Val Leu lie Leu Gin Phe Val Arg 
1425 1430 1435 144 

Thr Arg Leu Ser Cys Ser Lys Ser Gin Glu Arg Asn Pro lie Leu Leu 

1445 1450 1455 

Leu Lys lie Ala Glu Leu Phe Leu Asp Asn Asp Gin lie Glu Lys Leu 

1460 1465 1470 

His Asp Phe Glu Glu Asp Cys Met Glu Asp Leu Ala Arg Gin Lys Leu 

1475 1480 1485 

Asn Glu Lys Asn Thr Ser Asn Glu Gin Arg lie Leu Arg Ala Asp lie 

1490 1495 1500 

Arg Thr Asp Gin lie Leu Asn Arg Leu lie Asp Leu Gin Ala Lys Glu 
1505 1510 1515 152 

Ser Met Gly Arg Asp Val lie Asn Asp Val Glu Ser Arg Leu Ala Ser 

1525 1530 1535 

Val Glu Lys Ala Gin Asn Glu He Leu Glu Cys Val Arg Ala. Leu Leu 

1540 1545 1550 

Asn Gin Asn Asn Ala Pro Thr Ala He Gly Arg Cys Phe Ser Pro Ser 

1555 1560 1565 

Pro Asp Pro Leu Val Glu Thr Ala Asn Gly Thr Pro Gly Pro Leu Leu 

1570 1575 1580 

Leu Lys Leu Pro Gly Thr Asp Pro He Leu Glu Glu Lys Asp His Asp 
1585 1590 1595 160 

Ser Gly Glu Asn Ser Asn Ser Leu Pro Pro Gly Arg He Arg Arg Asn 

1605 1610 1615 

Arg Thr Ala Thr He Cys Gly. Gly Tyr Val Ser Glu Glu Arg Asn Met 

1620 1625 1630 

Met Leu Leu Ser Pro Lys Pro Ser Asp Val Ser Gly He Pro Gin Gin 

1635 1640 1645 

Arg Leu Met Ser Val Thr Ser Met Asp Pro Leu Pro Leu Pro Leu Ala 

1650 1655 1660 

Lys Leu Ser Thr Met Ser He Arg Arg Arg His Glu Glu Tyr Thr Ser 
1665 1670 1675 168 

He Thr Asp Ser He Ala He Arg His Pro Glu Arg Arg He Arg Asn 

1685 1690 1695 

Asn Arg Ser Asn Ser Ser Glu His Asp Glu Ser Ala Val Asp Ser Glu 

1700 1705 1710 

Gly Gly Gly Asn Val Thr Ser Ser Pro Arg Lys Arg Ser Thr Arg Asp 

1715 1720 1725 

Leu Arg Met Thr Pro Ser Ser Gin Val Glu Glu Ser Thr Ser Arg Asp 

1730 1735 1740 

Gin He Phe Glu He Asp His Pro Glu His Glu Glu Asp Glu Ala Gin 
1745 1750 1755 176 

Ala Asp Cys Glu Leu Thr Asp Val He Thr Glu Glu Glu Asp Glu Glu 

1765 1770 1775 

Glu Asp Asp Glu Glu Asp Asp Ser His Glu Arg His His He His Pro 

1780 1785 1790 

Arg Arg Lys Ser Ser Arg Gin Asn Arg Gin Pro Ser His Thr Leu Glu 
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1795 1800 1805 

Thr Asp Leu Ser Glu Gly Glu Glu Val Asp Pro Leu Asp Val Leu Lys 

1810 1815 1820 

Met Lys Glu Leu Pro lie lie His Gin lie Leu Asn Glu Glu Glu Gin 
1825 1830 1835 184 

Ala Gly Ala Pro His Ser Thr Pro Val lie Ala Ser Pro Ser Ser Ser 

1845 1850 1855 

Arg Ala Asp Leu Thr Ser Gin Lys Cys Ser Asp Val 
1860 1865 



<210> 16 
<211> 489 
<212> DNA 
' <213> Mus Musculus 



<400> 16 

ccctgaaaga ctcgacttct gctgctagcg ctggagctga gttagttttg agaaggtttc 60 

ccggggctgt ccttgttcgg tggcccgtgc caccgcctcc ggagacgctt tccgatagat 120 

ggctgcaggc cgcggaggtg gaggaggagc cgctgccctt ccggagtccg ccccgtgagg 180 

agaatgtccc agaaatcctg gatagagagc acttitgacca agagggagtg tgtatatatt 240 

ataccaagct ccaaagaccc tcacagatgt cttccaggat gtcagatttg tcagcaactt 300 

gtcagatgtt tctgtggtcg tttggtcaag caacatgcat gctttactgc aagtcttgcc 360 

atgaaatact cagatgtgaa attgggtgaa cactttaacc aggcaataga agaatggtct 420 

gtggaaaagc acacggagca gagcccaaca gatgcttatg gagtcatcaa ttttcaaggg 4 80 

ggttctcat 489 



<210> 17 

<211> 102 

<212> PRT 

<213> Mus Musculus 





<400> 


17 


























Met 


Ser 


Gin 


Lys 


Ser 


Trp 


He 


Glu 


Ser 


Thr 


Leu 


Thr 


Lys 


Arg 


Glu 


Cys 


1 








5 










10 










15 




Val 


Tyr 


lie 


He 


Pro 


Ser 


Ser 


Lys 


Asp 


Pro 


His 


Arg 


Cys 


Leu 


Pro 


Gly 








20 










25 










30 






Cys 


Gin 


lie 


Cys 


Gin 


Gin 


Leu 


Val 


Arg 


Cys 


Phe 


Cys 


Gly 


Arg 


Leu 


Val 






35 










40 










•4 5 








Lys 


Gin 


His 


Ala 


Cys 


Phe 


Thr 


Ala 


Ser 


Leu 


Ala 


Met 


Lys 


Tyr 


Ser 


Asp 




50 










55 










60 










Val 


Lys 


Leu 


Gly 


Glu 


His 


Phe 


Asn 


Gin 


Ala 


He 


Glu 


Glu 


Trp 


Ser 


Val 


65 . 










70 










75 










80 


Glu 


Lys 


His 


Thr 


Glu 


Gin 


Ser 


Pro 


Thr 


Asp 


Ala 


Tyr 


Gly 


Val 


He 


Asn 










85 










90 










95 




Phe 


Gin 


Gly 


Gly 


Ser 


His 























100 



<210> 18 

<211> 410 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 
<222> (6)... (6) 

<221> unsure 
<222> (58)... (58) 



<221> unsure 
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<222> (89) . . . (89) 

<221> unsure 

<222> (406) . . . (406) 



<400> 18 

gccgcnggag cctgagcgga gggtgtgcgc agcctcgcca gcgggggccc cgggctgngc 60 

cattgcctca ctgagccagc gcctgcctnc tacctcgccg acagctggaa ccagtgcgac 120 

ctagtggctc tcacctgctt cctcctgggc gtgggctgcc ggctgacccc gggtttgtac 180 

cacctgggcc gcactgtcct ctgcatcgac ttcatggttt tcacggtgcg gctgcttcac 240 

atcttcacgg tcaacaaaca gctggggccc aagatcgtca tcgtgagcaa gatgatgaag 300 

gacgtgttct tcttcctctt cttcctcggc gtgtggctgg tagctatggg ttgggccacg 360 

gaggggttcc tgaggccacg ggacagtgac ttcccaagta tcctgncgcc 410 

<210> 19 
<211> 131 
<212> PRT 

<213> Homo Sapiens 



<220> 

<221> UNSURE 
<222> (15) . . . (15) 
<223> UNKNOWN 



<221> UNSURE 
<222> (25) . . . (25) 
<223> UNKNOWN 



<221> UNSURE 

<222> (131) . . . (131) 

<223> UNKNOWN 





<400> 


19 










Ala 


Glu 


Gly 


Val 


Arg 


Ser 


Leu 


Ala 


1 








5 








Cys 


Leu 


Thr 


Glu 
20 


Pro 


Ala 


Pro 


Ala 


Gin 


Cys 


Asp 
35 


Leu 


Val 


Ala 


Leu 


Thr 
40 


Arg 


Leu 
50 


Thr 


Pro 


Gly 


Leu 


Tyr 
55 


His 


Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


65 










70 






Lys 


Gin 


Leu 


Gly 


Pro 
85 


Lys 


He 


Val 


Val 


Phe 


Phe 


Phe 
100 


Leu 


Phe 


Phe 


Leu 


Trp 


Ala 


Thr 
115 


Glu 


Gly 


Phe 


Leu 


Arg 
120 


lie 


Leu 


Xaa 













130 



Ser 


Gly 


Gly 


Pro 


Gly 


Leu 


Xaa 


His 




10 










15 




Xaa 


Tyr 


Leu 


Ala 


Asp 


Ser 


Trp Asn 


25 










30 






Cys 


Phe 


Leu 


Leu 


Gly 


Val 


Gly 


Cys 










45 








Leu 


Gly 


Arg 


Thr 


Val 


Leu 


Cys 


He 








60 










Leu 


Leu 


His 


He 


Phe 


Thr 


Val 


Asn 






75 










80 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 


Asp 




90 










95 




Gly 


Val 


Trp 


Leu 


Val 


Ala 


Met 


Gly 


105 










110 






Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 


Ser 



125 



<210> 20 

<211> 389 

<212> DNA 

<213> Homo Sapiens 

<400> 20 

caaatttttt gttagtacac catctcatcc aaattgcaaa agtcacatgg aaactggaac 60 
caaagatcaa gaaactgttt gctctaaagc tacagaagga gataatacag aatttggagc 120 
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atttgtagga cacagagata gcatggatzt acagaggttt aaagaaacat caaacaagat 180 

aaaaatacta tccaataaca atacttctga aaacactttg aaacgagtga gttctcttgc 240 

tggatttact gactgtcaca gaacttccat tcctgttcat tcaaaacgag aaaagatcag 300 

tagaaggcca tctaccgaag acactcatga agtagattcc aaagcagctt taataccggt 360 

ttgtagattt caactaaaca gatatatat 38 9 



<210> 21 

<211> 415 

<212> DNA 

<213> Homo Sapiens 



<400> 21 

atttctagtt tttcaaattt gccagtcttt ttgaatagta tctccttctt ttctcatgtt 60 

ttatatttaa aactttttta tgtccatcat cactttaaac atacttattt tgtcatctat 120 

aaccaataat tccactatct tatcagaaat caaataccgt ttatgtaagt tgactcccat 180 

gagttctaaa ttgccattgt gaggtcatct tcggttaggc tttaatttgt tgcaaagttg 240 

tgcagctcag ggtcaggaag agtccctcca gaaaggagga tttgttactg tgaatctctt 300 

tgttaactaa cctctttccc cactgaaata acttttttca ataacatgat tttaacaaca 360 

taatctctct atgccagaac agatatatat gaatgtaagt caatattttc ttgag 415 



<210> 22 

<211> 405 

<212> DNA 

<213> Mus Musculus 



<400> 22 

ttattatggc ttatcatgaa aaaccagtcc tgcctcctcc tcttatcatc ctcagccata 60 

tagtttcact gttttgctgt gtatgcaaaa gaagaaagaa agataagact tccgatgggc 120 

caaaactttt cttaacagaa gaagatcaaa agaaactcca tgattttgaa gagcagtgtg 180 

ttgagatgta ctttgatgag aaagatgaca aattcaattc tgggagtgaa gagagaatcc 240 

gggtcacttt tgaaagagtg gagcagatga gcattcagat taaagaagtt ggagatcgtg 300 

tcaactacat aaaaagatca ttacagtctt tagattctca aattggtcat ctgcaagatc 360 

tctcagccct aacagtagat acattgaaaa cacttacagc ccaga 405 



<210> 23 

<211> 5117 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 

<222> (2382) . . . (2382) 

<223> unknown 



<221> unsure 

<222> (4664) . . . (4664) 

<223> unknown 



<221> unsure 

<222> (4682) . . . (4682) 

<223> unknown 



<221> unsure 

<222> (4702) . . . (4702) 

<223> unknown 



<221> unsure 

<222> (5038) . . . (5039) 

<223> unknown 
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<221> unsure 

<222> (5056) . . . (5056) 

<223> unknown 

<221> unsure 

<222> (5071) . . . (5072) 



<400> 23 

gatggcaaca tggtgaagaa tcaatggcta aagcattagt tgcctgtaag atctatcgtt 60 

caatggcata tgaagcaaag cagagtgacc tggtagatga tacttcagaa gaactaaaac 120 

agtattccaa tgattttggt cagttggccg ttgaattatt agaacagtcc ttcagacaag 180 

atgaaaccat ggctatgaaa ttgctcactt atgaactgaa gaactggagt aattcaacct 240 

gccttaagtt agcagtttct tcaagactta gaccttttgt agctcacacc tgtacacaaa 300 

tgttgttatc tgatatgtgg atgggaaggc tgaatatgag gaaaaattcc tggtacaagg 360 

tcatactaag cattttagtt ccacctgcca tattgctgtt agagtataaa actaaggctg 420 

aaatgtccca tatcccacaa tctcaagatg ctcatcagat gacaatggat gacagcgaaa 480 

acaactttca gaacataaca gaagagatcc ccatggaagt gtttaaagaa gtacggattt 540 

tggatagtaa tgaaggaaag aatgagatgg agatacaaat gaaatcaaaa aagcttccaa 600 

ttacgcgaaa gttttatgcc ttttatcatg caccaattgt aaaattctgg tttaacacgt 660 

tggcatattt aggatttctg atgctttata catttgtggt tcttgtacaa atggaacagt 720 

taccttcagt tcaagaatgg attgttattg cttatatttt tacttatgcc attgagaaag 780 

tccgtgagat ctttatgtct gaagctiggga aagtaaacca gaagattaaa gtatggttta 840 

gtgattactt caacatcagt gatacaattg ccataatttc tttcttcatt ggatttggac 900 

taagatttgg agcaaaatgg aactttgcaa atgcatatga taatcatgtt tttgtggctg 960 

gaagattaat ttactgtctt aacataatat tttggtatgt gcgtttgcta gattttctag 1020 

ctgtaaatca acaggcagga ccttatgtaa tgatgattgg aaaaatggtg gccaatatgt 1080 

tctacattgt agtgattatg gctcttgtat tacttagttt tggtgttccc agaaaggcaa 1140 

tactttatcc tcatgaagca ccatcttgga ctcttgctaa agatatagtt tttcacccat 1200 

actggatgat ttttggtgaa gtttatgcat acgaaattga tgtgtgtgca aatgattctg 1260 

ttatccctca aatctgtggt cctgggacgt ggttgactcc atttcttcaa gcagtctacc 1320 

tctttgtaca gtatatcatt atggttaatc ttcttattgc atttttcaac aatgtgtatt 1380 

tacaagtgaa ggcaatttcc aatattgtat ggaagtacca gcgttatcat tttattatgg 1440 

cttatcatga gaaaccagtt ctgcctcctc cacttatcat tcttagccat atagtttctc 1500 

tgttttgctg catatgtaag agaagaaaga aagataagac ttccgatgga ccaaaacttt 1560 

tcttaacaga agaagatcaa aagaaacttc atgattttga agagcagtgt gttgaaatgt 1620 

atttcaatga aaaagatgac aaatttcatt ctgggagtga agagagaatt cgtgtcactt 1680 

ttgaaagagt ggaacagatg tgcattcaga ttaaagaagt tggagatcgt gtcaactaca 1740 

taaaaagatc attacaatca ttagattctc aaattggcca tttgcaagat ctttcagccc 1800 

tgacggtaga tacattaaaa acactcactg cccagaaagc gtcggaagct agcaaagttc 1860 

ataatgaaat cacacgagaa ctgagcattt ccaaacactt ggctcaaaac cttattgatg 1920 

atggtcctgt aagaccttct gtatggaaaa agcatggtgt tgtaaataca cttagctcct 1980 

ctcttcctca aggtgatctt gaaagtaata atccttttca ttgtaatatt ttaatgaaag 2040 

atgacaaaga tccccagtgt aatatatttg gtcaagactt acctgcagta ccccagagaa 2100 

aagaatttaa ttttccagag gctggttcct cttctggtgc cttattccca agtgctgttt 2160 

cccctccaga actgcgacag agactacatg gggtagaact cttaaaaata tttaataaaa 2220 

atcaaaaatt aggcagttca tctactagca taccacatct gtcatcccca ccaaccaaat 2280 

tttttgttag tacaccatct cagccaagtt gcaaaagcca cttggaaact ggaaccaaag 2340 

atcaagaaac tgtttgctct aaagctacag aaggagataa tncagaattt ggagcatttg 2400 

taggacacag agatagcatg gatttacaga ggtttaaaga aacatcaaac aagataaaaa 24 60 

tactatccaa taacaatact tctgaaaaca ctttgaaacg agtgagttct cttgctggat 2520 

ttactgactg tcacagaact tccattcctg ttcattcaaa acaagcagaa aaaatcagta 2580 

gaaggccatc taccgaagac actcatgaag tagattccaa agcagcttta ataccggatt 2640 

ggttacaaga tagaccatca aacagagaaa tgccatctga agaaggaaca ttaaatggtc 2700 

tcacttctcc atttaagcca gctatggata caaattacta ttattcagct gtggaaagaa 2760 

ataacttgat gaggttatca cagagcattc catttacacc tgtgcctcca agaggggagc 2820 

ctgtcacagi: gtatcgtttg gaagagagtt cacccaacat actaaataac agcatgtctt 2880 

cttggtcaca actaggcctc tgtgccaaaa tagagttttt aagcaaagag gagatgggag 2940 

gaggtttacg aagagctgtc aaagtacagt gtacctggtc agaacatgat atcctcaaat 3000 

cagggcatct ttatattatc aaatcttttc tt:ccagaggt ggttaataca tggtcaagta 3060 

tttataaaga agatacagtt ctgcatctct gtctgagaga aattcaacaa cagagagcag 3120 
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cacaaaagct 
ggttccttga 
aatgtatgac 
ctaatactct 
gggagttact 
taaaagcaga 
atgcaattaa 
ttccagatct 
cttcagattt 
ttcgtctgat 
aatgttactg 
gcaagttgaa 
atcagtcaga 
taattagtat 
tgttttgatg 
gtccatgcag 
cacgcagcct 
tgatgatatc 
tctaattcta 
cggtatttat 
atagaaccca 
actgttaagt 
acctaactca 
atctgtatct 
tatatcaaat 
ttgaacatga 
anactggcag 
ttgttgaata 
ttttcttttt 
gggagcagta 
gagatgataa 
tgtattactt 
ataaatttat 
gatccctcga 



tacgtttgcc 
agttttcctg 
tggagaattt 
ggaagagatc 
ggtacttgat 
a'gaaaagaga 
aaacttcaga 
gaagaggaat 
gaatcttcag 
gttataatat 
tgtcactttt 
tttgctgact 
aagtccctat 
ttgtaacttt 
agggtgatgc 
attttattgg 
tttggttaaa 
actggttccc 
ggttcacaga 
gaatcagggt 
aattgtgtaa 
agaagaaagc 
ctcgtcaagg 
atagcaacaa 
cagaatgtaa 
atgctgtttg 
attgcagtgc 
ttcaacaata 
ttttcttttt 
taaagcttat 
gctgtttaaa 
aaagatacta 
tgtttncaaa 
ggggccc 



tttaatcaaa 
ctgtattgcc 
agaaaataca 
atgctagcct 
ttgcaaggtg 
tcctgtgata 
gcaaaacatc 
gattatacgc 
cctggaaatt 
taatattact 
ccctcgggag 
ccagcacagt 
aggatagagc 
ttaaagggct 
cctctcttat 
cagctgtttt 
tggcagtcaa 
aactagatgc 
ggtatgagaa 
taggctccat 
aaatgttaat 
catggtaaac 
gaagttacct 
cccagaactt 
acatgaactc 
ggtacttaag 
anccttatgc 
acaagaaaag 
cttttgtttc 
ttgtatttag 
tagtgtttaa 
tatacgtttt 
aaaaaaaccc 
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tgaaacccaa 
attcagcagg 
acaataataa 
ttagccactg 
ttggtgaaaa 
tggtttttgg 
actgtaattc 
ctgataaaat 
ccaccaaaga 
■gaatcattgg 
gaaattgttt 
taaaaggtca 
tggcagctga 
ctttgtatag 
gtggtacaat 
attgccattc 
aattttcctc 
ttgttggcca 
gcctgaactg 
atttaaagat 
tgggtttttt 
tgatacataa 
tttgaggaaa 
acaaacttct 
ttgcatatat 
aaattrattc 
caataaaatg 
cttttcatct 
cttggtacta 
tagtgtatct 
tattgattgg 
gtatatcatt 
nntaaaaaaa 



atccatacca 
acagtggttt 
tggagatgag 
gacttacgaa 
tttgactgac 
cccagcaaat 
ttgctgtaga 
tatatttcct 
atcagaatca 
ttttgcctgc 
ggtaatatag 
atattctttt 
gaaattttaa 
cagaggatct 
accattaacc 
aactagggaa 
agtgtattta 
cgggaaggga 
aagaccattt 
agagccagtt 
aaacattgtt 
cctaaattat 
gttaaagtac 
ccaaagattt 
ttaaaattgt 
agtnggatta 
taatttaaca 
aagttttatg 
attttaattt 
catagataca 
gggtggggag 
aaatctttaa 
aaagggcggc 



tattctccaa 
gctgtggaag 
attattccaa 
tatacaagag 
ccatctgtga 
ctaggagaag 
aagcttaaac 
caggatgagc 
gctaattctg 
acctcacaga 
aaaggtgtat 
gacctgatta 
aggtaattga 
catttgactt 
aaaggtaggt 
atgaagaaat 
gtgtgttcag 
aatgacttgt 
tcaagaggga 
ttttttttaa 
ttatcaagtc 
aaaagcagaa 
ttttttccct 
tattgattgt 
gttggaacat 
tcattatgtg 
gccccagata 
ctttaatttt 
ttatttggaa 
gacaaggcaa 
aaagaaaaag 
aagaaatnna 
ccctctagag 



3180. 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5117 





<210> 


24 


























<211> 


1224 


























<212> 


PRT 


























<213> 


Homo Sapiens 






















<220> 




























<221> 


UNSURE 


























<222> 


(794) . . . 


(794) 






















<223> 


UNKNOWN 


























<400> 


24 
























Trp 


Gin His 


Gly Glu 


Glu 


Ser 


Met 


Ala 


Lys 


Ala 


Leu 


Val 


Ala 


Cys 


Lys 


1 




5 










10 










15 




He 


Tyr Arg 


Ser Met 


Ala 


Tyr 


Glu 


Ala 


Lys 


Gin 


Ser 


Asp 


Leu 


Val 


Asp 






20 








25 










30 






Asp 


Thr Ser 


Glu Glu 


Leu 


Lys 


Gin 


Tyr 


Ser 


Asn 


Asp 


Phe 


Gly 


Gin 


Leu 




35 








40 










45 








Ala 


Val Glu 


Leu Leu 


Glu 


Gin 


Ser 


Phe 


Arg 


Gin 


Asp 


Glu 


Thr 


Met 


Ala 




50 






55 










60 










Met 


Lys Leu 


Leu Thr 


Tyr 


Glu 


Leu 


Lys 


Asn 


Trp 


Ser 


Asn 


Ser 


Thr 


Cys 


65 






70 










75 










80 


Leu 


Lys Leu 


Ala Val 


Ser 


Ser Arg 


Leu Arg 


Pro 


Phe 


Val 


Ala 


His 


Thr 




85 










90 










95 
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Cys 


Thr 


Gin 


Met 


Leu 


Leu 


Ser 


Asp 


Met 


Trp 


Met 


Gly Arg 


Leu 


Asn 


Met 






100 










105 










110 






Arg 


Lys 


Asn 


Ser 


Trp 


Tyr 


Lys 


Val 


He 


Leu 


Ser 


He 


Leu 


Val 


Pro 


Pro 


115 










120 










125 








Ala 


He 


Leu 


Leu 


Leu 


Glu 


Tyr 


Lys 


Thr 


Lys 


Ala 


Glu 


Met 


Ser 


His 


He 




130 










135 










140 










Pro 


Gin 


Ser 


Gin 


Asp 


Ala 


His 


Gin 


Met 


Thr 


Met 


Asp 


Asp 


Ser 


Glu 


Asn 


145 










150 










155 










160 


Asn 


Phe 


Gin 


Asn 


He 


Thr 


Glu 


Glu 


He 


Pro 


Met 


Glu 


Val 


Phe 


Lys 


Glu 










165 










170 










175 




Val 


Arg 


He 


Leu 


Asp 


Ser 


Asn 


Glu 


Gly 


Lys 


Asn 


Glu 


Met 


Glu 


He 


Gin 








180 










185 










190 






Met 


Lys 


Ser 


Lys 


Lys 


Leu 


Pro 


He 


Thr 


Arg 


Lys 


Phe 


Tyr 


Ala 


Phe 


Tyr 






195 










200 










205 








His 


Ala 


Pro 


He 


Val 


Lys 


Phe 


Trp 


Phe 


Asn 


Thr 


Leu 


Ala 


Tyr 


Leu 


Gly 




210 










215 










220 










Phe 


Leu 


Met 


Leu 


Tyr 


Thr 


Phe 


Val 


Val 


Leu 


Val 


Gin 


Met 


Glu 


Gin 


Leu 


225 








230 










235 










240 


Pro 


Ser 


Val 


Gin 


Glu 


Trp 


He 


Val 


He 


Ala 


Tyr 


He 


Phe 


Thr 


Tyr 


Ala 










245 










250 










255 




lie 


Glu 


Lys 


Val 


Arg 


Glu 


He 


Phe 


Met 


Ser 


Glu 


Ala 


Gly 


Lys 


Val 


Asn 






260 










265 










270 






Gin 


Lys 


He 


Lys 


Val 


Trp 


Phe 


Ser 


Asp 


Tyr 


Phe 


Asn 


He 


Ser 


Asp 


Thr 






275 










280 










285 








He 


Ala 


He 


He 


Ser 


Phe 


Phe 


He 


Gly 


Phe 


Gly 


Leu 


Arg 


Phe 


Gly Ala 




290 










295 










300 










Lys 


Trp 


Asn 


Phe 


Ala 


Asn 


Ala 


Tyr 


Asp 


Asn 


His 


Val 


Phe 


Val 


Ala 


Gly 


305 










310 










315 










320 


Arg 


Leu 


He 


Tyr 


Cys 


Leu 


Asn 


He 


He 


Phe 


Trp 


Tyr 


Val 


Arg 


Leu 


Leu 










325 










330 










335 




Asp 


Phe 


Leu 


Ala 


Val 


Asn 


Gin 


Gin 


Ala 


Gly 


Pro 


Tyr 


Val 


Met 


Met 


He 








340 










345 










350 






Gly 


Lys 


Met 


Val 


Ala 


Asn 


Met 


Phe 


Tyr 


He 


Val 


Val 


He 


Met 


Ala 


Leu 






355 










360 










365 








Val 


Leu 


Leu 


Ser 


Phe 


Gly Val 


Pro 


Arg 


Lys 


Ala 


He 


Leu 


Tyr 


Pro 


His 




370 










375 










380 










Glu 


Ala 


Pro 


Ser 


Trp 


Thr 


Leu 


Ala 


Lys 


Asp 


He 


Val 


Phe 


His 


Pro 


Tyr 


385 










390 










395 










400 


Trp 


Met 


He 


Phe 


Gly 


Glu 


Val 


Tyr 


Ala 


Tyr 


Glu 


He 


Asp 


Val 


Cys 


Ala 










405 










410 










415 




Asn 


Asp 


Ser 


Val 


He 


Pro 


Gin 


He 


Cys 


Gly 


Pro 


Gly 


Thr 


Trp 


Leu 


Thr 








420 










425 










430 






Pro 


Phe 


Leu 


Gin 


Ala 


Val 


Tyr 


Leu 


Phe 


Val 


Gin 


Tyr 


He 


He 


Met 


Val 






435 










440 










445 








Asn 


Leu 


Leu 


He 


Ala 


Phe 


Phe 


Asn 


Asn 


Val 


Tyr 


Leu 


Gin 


Val 


Lys 


Ala 




450 










455 










460 










He 


Ser 


Asn 


He 


Val 


Trp 


Lys 


Tyr 


Gin 


Arg 


Tyr 


His 


Phe 


He 


Met 


Ala 


465 










470 










4 75 










480 


Tyr 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Leu 


Ser 


His 










485 










490 










495 




He 


Val 


Ser 


Leu 


Phe 


Cys 


Cys 


He 


Cys 


Lys 


Arg 


Arg 


Lys 


Lys 


Asp 


Lys 








500 










505 










510 






Thr 


Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 


Gin 


Lys 


Lys 






515 










520 










525 








Leu 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


Val 


Glu 


Met 


Tyr 


Phe 


Asn 


Glu 


Lys 




530 










535 










540 










Asp Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


Phe 


545 










550 










555 










560 


Glu Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly Asp Arg 



565 570 575 
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Val 


Asn 


Tyr 


He 


Lys 


Arg 


Ser 


Leu 


Gin 


Ser 


Leu 


Asp 


Ser 


Gin 


He 


Gly 








580 










585 










590 






His 


Leu 


Gin 


Asp 


Leu 


Ser 


Ala 


Leu 


Thr 


Val 


Asp 


Thr 


Leu 


Lys 


Thr 


Leu 






595 










600 










605 








Thr 


Ala 


Gin 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 


Glu 


He 


Thr 




610 










615 










620 










Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 


Leu 


He 


Asp 


Asp 


625 










630 










635 










640 


Gly 


Pro 


Val 


Arg 


Pro 


Ser 


Val 


Trp 


Lys 


Lys 


His 


Gly 


Val 


Val 


Asn 


Thr 










645 










650 










655 




Leu 


Ser 


Ser 


Ser 


Leu 


Pro 


Gin 


Gly 


Asp 


Leu 


Glu 


Ser 


Asn 


Asn 


Pro 


Phe 








660 
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670 






His 


Cys 


Asn 


He 


Leu 


Met 


Lys 


Asp 


Asp 


Lys 


Asp 


Pro 


Gin 


Cys 


Asn 


He 






675 










680 










685 








Phe 


Gly 


Gin 


Asp 


Leu 


Pro 


Ala 


Val 


Pro 


Gin 


Arg 


Lys 


Glu 


Phe 


Asn 


Phe 




690 










695 










700 










Pro 


Glu 


Ala 


Gly 


Ser 


Ser 


Ser 


Gly 


Ala 


Leu 


Phe 


Pro 


Ser 


Ala 


Val 


Ser 


705 








710 










715 










720 


Pro 


Pro 


Glu 


Leu 


Arg 


Gin 


Arg 


Leu 


His 


Gly 


Val 


Glu 


Leu 


Leu 


Lys 


He 










725 










730 










735 




Phe 


Asn 


Lys 


Asn 


Gin 


Lys 


Leu 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


He 


Pro 


His 






740 










745 










750 






Leu 


Ser 


Ser 


Pro 


Pro 


Thr 


Lys 


Phe 


Phe 


Val 


Ser 


Thr 


Pro 


Ser 


Gin 


Pro 






755 










760 










765 








Ser 


Cys 


Lys 


Ser 


His 


Leu 


Glu 


Thr 


Gly 


Thr 


Lys 


Asp 


Gin 


Glu 


Thr 


Val 




770 










775 










780 










Cys 


Ser 


Lys 


Ala 


Thr 


Glu 


Gly 


Asp 


Asn 


Xaa 


Glu 


Phe 


Gly 


Ala 


Phe 


Val 


785 










790 










795 










800 


Gly 


His 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


Thr 


Ser 


Asn 










805 










810 










815 




Lys 


He 


Lys 


He 


Leu 


Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


Glu 


Asn 


Thr 


Leu 


Lys 




820 










825 










830 






Arg 


Val 


Ser 


Ser 


Leu 


Ala 


Gly 


Phe 


Thr 


Asp 


Cys 


His 


Arg 


Thr 


Ser 


He 






835 










840 










845 








Pro 


Val 


His 


Ser 


Lys 


Gin 


Ala 


Glu 


Lys 


He 


Ser 


Arg 


Arg 


Pro 


Ser 


Thr 




850 










855 










860 










Glu 


Asp 


Thr 


His 


Glu 


Val 


Asp 


Ser 


Lys 


Ala 


Ala 


Leu 


He 


Pro 


Asp 


Trp 


865 










870 










875 










880 


Leu 


Gin 


Asp 


Arg 


Pro 


Ser 


Asn 


Arg 


Glu 


Met 


Pro 


Ser 


Glu 


Glu 


Gly 


Thr 










885 










890 










895 




Leu 


Asn 


Gly 


Leu 


Thr 


Ser 


Pro 


Phe 


Lys 


Pro 


Ala 


Met 


Asp 


Thr 


Asn 


Tyr 








900 










905 










910 






Tyr 


Tyr 


Ser 


Ala 


Val 


Glu 


Arg 


Asn 


Asn 


Leu 


Met 


Arg 


Leu 


Ser 


Gin 


Ser 






915 










920 










925 








He 


Pro 


Phe 


Thr 


Pro 


Val 


Pro 


Pro 


Arg 


Gly 


Glu 


Pro 


Val 


Thr 


Val 


Tyr 




930 










935 










940 










Arg 


Leu 


Glu 


Glu 


Ser 


Ser 


Pro 


Asn 


He 


Leu 


Asn 


Asn 


Ser 


Met 


Ser 


Ser 


945 










950 










955 










960 


Trp 


Ser 


Gin 


Leu 


Gly 


Leu 


Cys 


Ala 


Lys 


He 


Glu 


Phe 


Leu 


Ser 


Lys 


Glu 










965 










970 










975 




Glu 


Met 


Gly 


Gly 


Gly 


Leu 


Arg 


Arg 


Ala 


Val 


Lys 


Val 


Gin 


Cys 


Thr 


Trp 








980 










985 










990 






Ser 


Glu 


His 


Asp 


He 


Leu 


Lys 


Ser 


Gly 


His 


Leu 


Tyr 


He 


He 


Lys 


Ser 






995 










1000 








1005 






Phe 


Leu 


Pro 


Glu 


Val 


Val 


Asn 


Thr 


Trp 


Ser 


Ser 


He 


Tyr 


Lys 


Glu 


Asp 




1010 








1015 








1020 








Thr 


Val 


Leu 


His 


Leu 


Cys 


Leu 


Arg 


Glu 


He 


Gin 


Gin 


Gin 


Arg 


Ala 


Ala 


1025 








1030 








1035 








104 


Gin 


Lys 


Leu 


Thr 


Phe 


Ala 


Phe 


Asn 


Gin 


Met 


Lys 


Pro 


Lys 


Ser 


He 


Pro 








1045 








1050 








1055 
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Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His. Ser Ala 

1060 1065 1070 

Gly Gin Trp Phe Ala Val Glu Glu Cys Met Thr Gly Glu Phe Arg Lys 

1075 1080 1085 

Tyr Asn Asn Asn Asn Gly Asp Glu lie lie Pro Thr Asn Thr Leu Glu 

1090 1095 1100 

Glu lie Met Leu Ala Phe Ser His Trp Thr Tyr Glu Tyr Thr Arg Gly 
1105 1110 1115 112 

Glu Leu Leu Val Leu Asp Leu Gin Gly Val Gly Glu Asn Leu Thr Asp 

1125 1130 1135 

Pro Ser Val lie Lys Ala Glu Glu Lys Arg Ser Cys Asp Met Val Phe 

1140 1145 1150 

Gly Pro Ala Asn Leu Gly Glu Asp Ala lie Lys Asn Phe Arg Ala Lys 

1155 1160 1165 

His His Cys Asn Ser Cys Cys Arg Lys Leu Lys Leu Pro Asp Leu Lys 

1170 1175 1180 

Arg Asn Asp Tyr Thr Pro Asp Lys lie lie Phe Pro Gin Asp Glu Pro 
1185 1190 1195 120 

Ser Asp Leu Asn Leu Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu Ser 

1205 1210 1215 

Ala Asn Ser Val Arg Leu Met Leu 
1220 



<210> 25 

<211> 2180 

<212> DNA 

<213> Homo Sapiens 



<400> 25 



tcgaggccaa 
tgggcactgt 
gcagtttcct 
gacgctgagg 
gacctctttg 
cgctgcccgc 
gccttctttg 
gccagcacta 
acccgcctca 
gacatggata 
acgccgctgg 
gggcgccggt 
ggcaacgtgg 
ttccagccgg 
ctgtgcgagg 
cccgggcctg 
aaccagtgcg 
ccgggtttgt 
cggctgcttc 
aagatgatga 
ggcgtggcca 
cgcgtcttct 
gtggccctca 
ggggcccagg 
gtcatcttcc 
tacacattcg 
ctcatccggg 
ttgcgcctcc 
gccctcgagc 
gaatcggtgc 
tccgagmgtc 



gaattcggca 
tgctgaacag 
cagctcttgg 
aggcagcacg 
gcgagtgcta 
tctgggggga 
cccaggatgg 
cacccatctg 
tcaccttcag 
gtgtcattaa 
gggtcccgcg 
gcctacgccg 
tcagctacct 
cgccgcccgg 
aactgcgcca 
gccatgcctc 
acctagtggc 
accacctggg 
acatcttcac 
aggacgtgtt 
cggaggggct 
accgtcccta 
tggagcacag 
cgggcacctg 
tgctcgtggc 
gcaaagtaca 
aattccactc 
tgctcaggca 
atttccgggt 
ataaggagaa 
tgaagcgcac 



cgagggcctc 
ggcacagatg 
ggcctgtttg 
gaggaaagac 
tcgcagcagt 
tgccacttgc 
ggtacagtct 
ggccctggtt 
gaaatcagaa 
tggggaaggg 
ccagtcgggc 
ctggttccac 
gctgttcctg 
ctccctggag 
gggcctgagc 
actgagccag 
tctcacctgc 
ccgcactgtc 
ggtcaacaaa 
cttcttcctc 
cctgaggcca 
cctgcagatc 
caactgctcg 
cgtctcccag 
caacatcctg 
gggcaacagc 
tcggcccgcg 
attgtgcagg 
ttacctttct 
ctttctgctg 
gtcccagaag 



gggcaggccc 
gccatgtact 
ctgctccggg 
ctggcgttca 
gaggtgaggg 
ctccagctgg 
ctgctgacac 
ctcgccttct 
gaggagccca 
cctgtcggga 
cgtccgggtt 
ttctggggcg 
ctgcttttct 
ctgctgctct 
ggaggcgggg 
cgcctgcgcc 
ttcctcctgg 
ctctgcatcg 
cagctggggc 
ttcttcctcg 
cgggacagtg 
ttcgggcaga 
tcggagcccg 
tatgccaact 
ctggtcaact 
gatctctact 
ctggccccgc 
cgacccsgga 
aaggaagccg 
gcacgcgcta 
gtggacttgg 



cctggagcga 
tctgggagat 
tgatggcacg 
agtttgaggg 
ctgcccgcct 
ccatgcaagc 
agaagtggtg 
tttgccctcc 
cacgggagga 
cggcggaccc 
gctgcggggg 
cgccggtgac 
cgcgggtgct 
atttctgggc 
gcagcctcgc 
tctacctcgc 
gcgtgggctg 
acttcatggt 
ccaagatcgt 
gcgtgtggct 
acttcccaag 
ttccccagga 
gcttctgggc 
ggctggtggt 
tgctcattgc 
ggaaggcgca 
cctttatcgt 
gcccccagcc 
agcggaagct 
gggacaagcg 
cactgaaaca 



cctgcttctt 
gggttccaat 
cctggagcct 
gatgggcgtt 
cctcctccgt 
tgacgcccgt 
gggagatatg 
actcatctac 
gctagagttt 
agccgagaag 
ccgctgcggg 
catcttcatg 
gctcgtggat 
tttcacgctg 
cagcgggggc 
cgacagctgg 
ccggctgacc 
tttcacggtg 
catcgtgagc 
ggtagcctat 
tatcctgcgc 
ggacatggac 
acaccctcct 
gctgctcctc 
catgttcagt 
gcgttaccgc 
catctcccac 
gtcctccccg 
gctaacgtgg 
ggagagcgac 
gctgggacac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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atccgcgagt acgaacagcg cctgaaagtg ctggagcggg aggtccagca gtgtacctcg 

gcccccgcac ctggtggcct tgtccttgag gtgagcccca tgtccatctg ggccactgtc 

aggaccacct ttgggagtgt catccttaca aaccacagca tgcccggctc ctcccagaac 

cagtcccagc ctgggaggat caaggcctgg atcccrggcc gttatccatc tggaggctgc 

agggtccttg gggtaacagg gaccacagac ccctcaccac tcacagattc ctcacactgg 
ggaaataaag ccatttcaga 

<210> 26 
<211> 725 
<212> PRT 
<213> Homo Sapiens 

<220> 

<221> UNSURE 
<222> (553) . . . (553) 
<223> UNKNOWN 

<221> UNSURE 
<222> (603) . . . (603) 
<223> UNKNOWN 



<400> 26 



Ser 


Arg 


Pro 


Arg 


He 


Arg 


His 


Glu 


Gly 


Leu 


Gly 


Gin 


Ala 


Pro 


Trp 


Ser 


1 








5 










10 










15 




Asp 


Leu 


Leu 


Leu 


Trp 


Ala 


Leu 


Leu 


Leu 


Asn 


Arg 


Ala 


Gin 


Met 


Ala 


Met 








20 










25 










30 






Tyr 


Phe 


Trp 


Glu 


Met 


Gly 


Ser 


Asn 


Ala 


Val 


Ser 


Ser 


Ala 


Leu 


Gly 


Ala 






35 










40 










45 








Cys 


Leu 


Leu 


Leu 


Arg 


Val 


Met 


Ala 


Arg 


Leu 


Glu 


Pro 


Asp 


Ala 


Glu 


Glu 




50 










55 










60 










Ala 


Ala 


Arg 


Arg 


Lys 


Asp 


Leu 


Ala 


Phe 


Lys 


Phe 


Glu 


Gly 


Met 


Gly 


Val 


65 










70 










75 










80 


Asp 


Leu 


Phe 


Gly 


Glu 


Cys 


Tyr 


Arg 


Ser 


Ser 


Glu 


Val 


Arg 


Ala 


Ala 


Arg 










85 










90 










95 




Leu 


Leu 


Leu 


Arg 


Arg 


Cys 


Pro 


Leu 


Trp 


Gly 


Asp 


Ala 


Thr 


Cys 


Leu 


Gin 








100 










105 










110 






Leu 


Ala 


Met 


Gin 


Ala 


Asp 


Ala 


Arg 


Ala 


Phe 


Phe 


Ala 


Gin 


Asp 


Gly 


Val 






115 










120 










125 








Gin 


Ser 


Leu 


Leu 


Thr 


Gin 


Lys 


Trp 


Trp 


Gly 


Asp 


Met 


Ala 


Ser 


Thr 


Thr 




130 










135 










140 










Pro 


He 


Trp 


Ala 


Leu 


Val 


Leu 


Ala 


Phe 


Phe 


Cys 


Pro 


Pro 


Leu 


He 


Tyr 


145 










150 










155 










160 


Thr 


Arg 


Leu 


He 


Thr 


Phe 


Arg 


Lys 


Ser 


Glu 


Glu 


Glu 


Pro 


Thr 


Arg 


Glu 










165 










170 










175 




Glu 


Leu 


Glu 


Phe 


Asp 


Met 


Asp 


Ser 


Val 


He 


Asn 


Gly 


Glu 


Gly 


Pro 


Val 








180 










185 










190 






Gly 


Thr 


Ala 


Asp 


Pro 


Ala 


Glu 


Lys 


Thr 


Pro 


Leu 


Gly 


Val 


Pro 


Arg 


Gin 






195 










200 










205 








Ser 


Gly 


Arg 


Pro 


Gly 


Cys 


Cys 


Gly 


Gly 


Arg 


Cys 


Gly 


Gly 


Arg 


Arg 


Cys 




210 










215 










220 










Leu 


Arg 


Arg 


Trp 


Phe 


His 


Phe 


Trp 


Gly 


Ala 


Pro 


Val 


Thr 


He 


Phe 


Met 


225 










230 










235 










240 


Gly 


Asn 


Val 


Val 


Ser 


Tyr 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Phe 


Ser 


Arg 


Val 










245 










250 










255 




Leu 


Leu 


Val 


Asp 


Phe 


Gin 


Pro 


Ala 


Pro 


Pro 


Gly 


Ser 


Leu 


Glu 


Leu 


Leu 








260 










265 










270 






Leu 


Tyr 


Phe 


Trp 


Ala 


Phe 


Thr 


Leu 


Leu 


Cys 


Glu 


Glu 


Leu Arg 


Gin 


Gly 






275 










280 










285 








Leu 


Ser 


Gly 


Gly 


Gly 


Gly 


Ser 


Leu 


Ala 


Ser 


Gly 


Gly 


Pro 


Gly 


Pro 


Gly 



290 295 300 



1920 
1980 
2040 
2100 
2160 
2180 
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His 


Ala 


Ser 


Leu 


Ser 


Gin 


Arg 


Leu 


Arg 


Leu 


Tyr 


Leu 


Ala 


Asp 


Ser 


Trp 


305 










310 










315 










"son 


Asn 


Gin 


Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly Val 


Gly 










325 










330 










335 




Cys 


Arg 


Leu 


Thr 


Pro Gly 


Leu 


Tyr 


His 


Leu 


Gly 


Arg 


Thr 


Val 


Leu 


Cys 








340 










34 5 










350 






He 


Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 






355 










360 










365 








Asn 


Lys 


Gin 


Leu 


Gly 


Pro 


Lys 


He 


Val 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 




370 










375 










380 










Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Gly Val 


Trp 


Leu 


Val 


Ala 


Tyr 


385 










390 










395 










4 00 


Gly 


Val 


Ala 


Thr 


Glu 


Gly 


Leu 


Leu 


Arg 


Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 










405 










410 










415 




Ser 


He 


Leu 


Arg 


Arg 


Val 


Phe 


Tyr 


Arg 


Pro 


Tyr 


Leu 


Gin 


He 


Phe 


Gly 








420 










425 










430 






Gin 


lie 


Pro 


Gin 


Glu 


Asp 


Met 


Asp 


Val 


Ala 


Leu 


Met 


Glu 


His 


Ser 


Asn 






435 










440 










445 








Cys 


Ser 


Ser 


Glu 


Pro 


Gly 


Phe 


Trp 


Ala 


His 


Pro 


Pro 


Gly Ala 


Gin 


Ala 


450 










455 










460 










Gly 


Thr 


Cys 


val 


Ser 


Gin 


Tyr 


Ala 


Asn 


Trp 


Leu 


Val 


Val 


Leu 


Leu 


Leu 


465 










470 










475 










480 


Val 


lie 


Phe 


Leu 


Leu 


Val 


Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 










485 










4 90 










495 




Ala 


Met 


Phe 


Ser 


Tyr 


Thr 


Phe 


Gly 


Lys 


Val 


Gin 


Gly Asn 


Ser 


Asp 


Leu 








500 










505 










510 






Tyr 


Trp 


Lys 


Ala 


Gin Arg 


Tyr 


Arg 


Leu 


He 


Arg 


Glu 


Phe 


His 


Ser Arg 






515 










520 










525 








Pro 


Ala 


Leu 


Ala 


Pro 


Pro 


Phe 


He 


Val 


He 


Ser 


His 


Leu Arg 


Leu 


Leu 




530 










535 










540 










Leu 


Arg 


Gin 


Leu Cys 


Arg 


Arg 


Pro 


Xaa 


Ser 


Pro 


Gin 


Pro 


Ser 


Ser 


Pro 


545 










550 










555 










560 


Ala 


Leu 


Glu 


His 


Phe 


Arg 


Val 


Tyr 


Leu 


Ser 


Lys 


Glu 


Ala 


Glu 


Arg 


Lys 










565 










570 










575 




Leu 


Leu 


Thr 


Trp 


Glu 


Ser 


Val 


His 


Lys 


Glu 


Asn 


Phe 


Leu 


Leu 


Ala 


Arg 








580 










585 










590 






Ala 


Arg 


Asp 


Lys 


Arg 


Glu 


Ser 


Asp 


Ser 


Glu 


Xaa 


Leu 


Lys 


Arg 


Thr 


Ser 






595 










600 










605 








Gin 


Lys 


Val 


Asp 


Leu 


Ala 


Leu 


Lys 


Gin 


Leu 


Gly 


His 


He 


Arg 


Glu 


Tyr 




610 










615 










620 










Glu 


Gin 


Arg 


Leu 


Lys 


Val 


Leu 


Glu 


Arg 


Glu 


Val 


Gin 


Gin 


Cys 


Thr 


Ser 












630 










635 










640 


Ala 


Pro 


Ala 


Pro 


Gly 


Gly 


Leu 


Val 


Leu 


Glu 


Val 


Ser 


Pro 


Met 


Ser 


He 










645 










650 










655 




Trp 


Ala 


Thr 


Val 


Arg 


Thr 


Thr 


Phe 


Gly 


Ser 


Val 


He 


Leu 


Thr 


Asn 


His 








660 










665 










670 






Ser 


Met 


Pro 


Gly 


Ser 


Ser 


Gin 


Asn 


Gin 


Ser 


Gin 


Pro 


Gly 


Arg 


He 


Lys 






675 










680 










685 








Ala 


Trp 


He 


Pro 


Gly 


Arg 


Tyr 


Pro 


Ser 


Gly 


Gly 


Cys 


Arg 


Val 


Leu 


Gly 




690 










695 










700 










Val 


Thr 


Gly 


Thr 


Thr 


Asp 


Pro 


Ser 


Pro 


Leu 


Thr 


Asp 


Ser 


Ser 


His 


Trp 


705 










710 










715 










720 


Gly 


Asn 


Lys 


Ala 


He 

























725 
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<400> 27 

cggggaccga tccagcctcc ggactctage ctaggctttt gcaaaaagct atttaggtga 60 

cactatagaa ggtacgcctg caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc 120 

gcagccccgt cgccggcgga ggcgggcgcg ggcgcgtncc ctgtggccag tcacccggag 180 

gagttggtcg cacaattatg aaagactcgg cttctgctgc tagcgccgga gctgagttag 24 0 

ttctgagaag gtttccctgg gcgttccttg tccggcggcc tctgctgccg cctccggaga 300 

cgcttcccga tagatggcta caggccgcgg aggaggagga ggtggagttg ctgcccttcc 360 

ggagtccgcc ccgtgaggag aatgtcccag aaatcctgga tagaaagcac tttgaccaag 4 20 

agggaatgtg tatatattat accaagttcc aaggaccctc acagatgcct tccaggatgt 4 80 

caaatttgtc agcaactcgt caggtgtttt tgtggtcgct tggtcaagca acatgcttgt 54 0 

tttactgcaa gtcttgccat gaaatactca gatgtgaaat tgggtgacca ttttaatcag 600 

gcaatagaag aatggtctgt ggaaaagcat acagaacaga gcccaacgga tgcttatgga 660 

gtcataaatt ttcaaggggg ttctcattcc tacagagcta agtatgtgag gctatcatat 720 

gacaccaaac ctgaagtcat tctgcaactt ctgcttaaag aatggcaaat ggagttaccc 780 

aaacttgtta tctctgtaca tgggggcatg cagaaatttg agcttcaccc acgaatcaag 840 

cagttgcttg gaaaaggtct tattaaagct gcagttacaa ctggagcctg gattttaact 900 

ggaggagtaa acacaggtgt ggcaaaacat gttggagatg ccctcaaaga acatgcttcc 960 

agatcatctc gaaagatttg cactatcgga atagctccat ggggagtgat tgaaaacaga 1020 

aatgatcttg ttgggagaga tgtggttgct ccttatcaaa ccttattgaa ccccctgagc 1080 

aaattgaatg ttttgaataa tctgcattcc catttcatat tggtggatga tggcactgtt 1140 

ggaaagtatg gggcggaagt cagactgaga agagaacttg aaaaaactat taatcagcaa 1200 
agaattcatg ctaggattgg ccagggtgtc cctgtggtgg cacttatatt tgagggtggg • 1260 

ccaaatgtta tcctcacagt tcttgaatac cttcaggaaa gcccccctgt tccagtagtt 1320 

gtgtgtgaag gaacaggcag agctgcagat ctgctagcgt atattcataa acaaacagaa 1380 

gaaggaggga atcttcctga tgcagcagag cccgatatta tttccactat caaaaaaaca 144 0 

tttaactttg gccagaatga agcacttcat ttatttcaaa cactgatgga gtgcatgaaa 1500 

agaaaggagc ttatcactgt tttccatatt gggtcagatg aacatcaaga tatagatgta 1560 

gcaatactta ctgcactgct aaaaggtact aatgcatctg catttgacca gcttatcctt 1620 

acattggcat gggatagagt tgacattgcc aaaaatcatg tatttgttta tggacagcag 1680 

tggctggttg gatccttgga acaagctatg cttgatgctc ttgtaatgga tagagttgca 1740 

tttgtaaaac ttcttattga aaatggagta agcatgcata aattccttac cattccgaga 1800 

ctggaagaac tttacaacac taaacaaggt ccaactaatc caatgctgtt tcatcttgtt 18 60 

cgagacgtca aacagggaaa tcttcctcca ggatataaga tcactctgat tgatatagga 1920 

cttgttattg aatatctcat gggaggaacc tacagatgca cctatactag gaaacgtttt 1980 

cgattaatat ataatagtct tggtggaaat aatcggaggt ctggccgaaa tacctccagc 204 0 

agcactcctc agttgcgaaa gagtcatgaa tcttttggca atagggcaga taaaaaggaa 2100 

aaaatgaggc ataaccattt cattaagaca gcacagccct tccgaccaaa gattgataca 2160 

gttatggaag aaggaaagaa gaaaagaacc aaagatgaaa ttgtagacat tgatgatcca 2220 

gaaaccaagc gctttcctta tccacttaat gaacttttaa tttgggcttg ccttatgaag 2280 

aggcaggtca tggcccgttt tttatggcaa catggtgaag aatcaatggc taaagcatta 2340 

gttgcctgta agatctatcg ttcaatggca tatgaagcaa agcagagtga cctggtagat 2400 

gatacttcag aagaactaaa acagtattcc aatgattttg gtcagttggc cgttgaatta 24 60 

ttagaacagt ccttcagaca agatgaaacc atggctatga aattgctcac ttatgaactg 2520 

aagaactgga gtaattcaac ctgccttaag ttagcagttt cttcaagact tagacctttt 2580 

gtagctcaca cctgtacaca aatgttgtta tctgatatgt ggatgggaag gctgaatatg 264 0 

aggaaaaatt cctggtacaa ggtcatacta agcattttag ttccacctgc catattgctg 2700 

ttagagtata aaactaaggc tgaaatgtcc catatcccac aatctcaaga tgctcatcag 27 60 

atgacaatgg atgacagcga aaacaacttt cagaacataa cagaagagat ccccatggaa 2820 

gtgtttaaag aagtacggat tttggatagt aatgaaggaa agaatgagat ggagatacaa 2880 

atgaaatcaa aaaagcttcc aattacgcga aagttttatg ccttttatca tgcaccaatt 294 0 

gtaaaattct ggtttaacac gttggcatat ttaggatttc tgatgcttta tacatttgtg 3000 

gttcttgtac aaatggaaca gttaccttca gttcaagaat ggattgttat tgcttatatt 3060 

tttacttatg ccattgagaa agtccgtgag atctttatgt ctgaagctgg gaaagtaaac 3120 

cagaagatta aagtatggtt tagtgattac ttcaacatca gtgatacaat tgccataatt 3180 

tctttcttca ttggatttgg actaagattt ggagcaaaat ggaactttgc aaatgcatat 324 0 

gataatcatg tttttgtggc tggaagatta atttactgtc ttaacataat attttggtat 3300 

gtgcgtttgc tagattttct agctgtaaat caacaggcag gaccttatgt aatgatgatt 3360 

ggaaaaatgg tggccaatat gttctacatt gtagtgatta tggctcttgt attacttagt 3420 

tttggtgttc ccagaaaggc aatactttat cctcatgaag caccatcttg gactcttgct 3480 

aaagatatag tttttcaccc atactggatg atttttggtg aagtttatgc atacgaaatt 354 0 
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gatgtgtgtg caaatgattc tgttatccct caaatctgtg gtcctgggac gtggttgact 3600 

ccatttcttc aagcagtcra cctctttgta cagtatatca ttatggttaa tcttcttatt 3660 

gcatttttca acaatgtgta tttacaagtg aaggcaattt ccaatattgt atggaagtac 3720 

cagcgttatc attttattat ggcttatcat gagaaaccag ttctgcctcc tccacttatc 3780 

attcttagcc atatagtttc tctgttttgc tgcatatgta agagaagaaa gaaagataag 3840 

acttccgatg gaccaaaact tttcttaaca gaagaagatc aaaagaaact tcatgatttt 3900 

gaagagcagt gtgttgaaat gtatttcaat gaaaaagatg acaaatttca ttctgggagt 3960 

gaagagagaa ttcgtgtcac ttttgaaaga gtggaacaga tgtgcattca gattaaagaa 4020 

gttggagatc gtgtcaacta cataaaaaga tcattacaat cattagattc tcaaattggc 4080 

catttgcaag atctttcagc cctgacggta gacacattaa aaacactcac tgcccagaaa 4140 

gcgtcggaag ctagcaaagt tcataatgaa atcacacgag aactgagcat ttccaaacac 4200 

ttggctcaaa accttattga tgatggtcct gtaagacctt ctgtatggaa aaagcatggt 4260 

gttgtaaata cacttagctc ctctcttcct caaggtgatc ttgaaagtaa taatcctttt 4 320 

cattgtaata ttttaatgaa agatgacaaa gacccccagt gtaatatatt tggtcaagac 4380 

ttacctgcag taccccagag aaaagaattt aattttccag aggctggttc ctcttctggt 4440 

gccttattcc caagtgctgt ttcccctcca gaactgcgac agagactaca tggggtagaa 4 500 

ctcttaaaaa tatttaataa aaatcaaaaa ttaggcagtt catctactag cataccacat 4 5 60 

ctgtcatccc caccaaccaa attttttgtt agtacaccat ctcagccaag ttgcaaaagc 4 620 

cacttggaaa ctggaaccaa agatcaagaa actgtttgct ctaaagctac agaaggagat 4 680 

aatacagaat ttggagcatt tgtaggacac agagatagca tggatttaca gaggtttaaa 474 0 

gaaacatcaa acaagataaa aatactatcc aataacaata cttctgaaaa cactttgaaa 4800 

cgagtgagtt ctcttgctgg atttactgac tgtcacagaa cttccattcc tgttcattca 4860 

aaacaagcag aaaaaatcag tagaaggcca tc^accgaag acactcatga agtagattcc 4 920 

aaagcagctt taataccgga ttggttacaa gatagaccat caaacagaga aatgccatct 4 980 

gaagaaggaa cattaaatgg tctcacttct ccatttaagc cagctatgga tacaaattac 5040 

tattattcag ctgtggaaag aaataacttg atgaggttat cacagagcat tccatttaca 5100 

cctgtgcctc caagagggga gcctgtcaca gtgtatcgtt tggaagagag ttcacccaac 5160 

atactaaata acagcatgtc ttcttggtca caactaggcc tctgtgccaa aatagagttt 5220 

ttaagcaaag aggagatggg aggaggttta cgaagagctg tcaaagtaca gtgtacctgg 5280 

tcagaacatg atatcctcaa atcagggcat ctttatatta tcaaatcttt tcttccagag 5340 

gtggttaata catggtcaag tatttataaa gaagatacag ttctgcatct ctgtctgaga 5400 

gaaattcaac aacagagagc agcacaaaag cttacgtttg cctttaatca aatgaaaccc 54 60 

aaatccatac catattctcc aaggttcctt gaagttttcc tgctgtattg ccattcagca 5520 

ggacagtggt ttgctgtgga agaatgtatg actggagaat ttagaaaata caacaataat 5580 

aatggagatg agattattcc aactaatact ctggaagaga tcatgctagc ctttagccac 5640 

tggacttacg aatatacaag aggggagtta ctggtacttg atttgcaagg tgttggtgaa 5700 

aatttgactg acccatctgt gataaaagca gaagaaaaga gatcctgtga tatggttttt 5760 

ggcccagcaa atctaggaga agatgcaatt aaaaacttca gagcaaaaca tcactgtaat 5820 

tcttgctgta gaaagcttaa acttccagat ctgaagagga atgattatac gcctgataaa 5880 

attatatttc ctcaggatga gccttcagat ttgaatcttc agcctggaaa ttccaccaaa 5940 

gaatcagaat caactaattc tgttcgtctg atgttataat attaatatta ctgaatcatt 6000 

ggttttgcct gcacctcaca gaaatgttac tgtgtcactt ttccctcggg aggaaattgt 6060 

ttggtaatat agaaaggtgt atgcaagttg aatttgctga ctccagcaca gttaaaaggt 6120 

caatattctt ttgacctgat taatcagtca gaaagtccct ataggataga gctggcagct 6180 

gagaaatttt aaaggtaatt gataattagt atttgtaact ttttaaaggg ctctttgtat 6240 

agcagaggat ctcatttgac tttgttttga tgagggtgat gccctctctt atgtggtaca 6300 

ataccattaa ccaaaggtag gtgtccatgc agattttatt ggcagctgtt ttattgccat 6360 

tcaactaggg aaatgaagaa atcacgcagc cttttggtta aatggcagtc aaaattttcc 6420 

tcagtgtatt tagtgtgttc agtgatgata tcactggttc ccaactagat gcttgttggc 64 80 

cacgggaagg gaaatgactt gttctaattc taggttcaca gaggtatgag aagcctgaac 654 0 

tgaagaccat tttcaagagg gacggtattt atgaatcagg gttaggctcc atatttaaag 6600 

atagagccag tttttttttt aaatagaacc caaattgtgt aaaaatgtta attgggtttt 6660 

ttaaacattg ttttatcaag tcactgttaa gtagaagaaa gccatggtaa actgatacat 6720 

aacctaaatt ataaaagcag aaacctaact cactcgtcaa gggaagttac cttttgagga 6780 

aagttaaagt acttttttcc ctatctgtat ctatagcaac aacccagaac ttacaaactt 684 0 

ctccaaagat tttattgatt gttatatcaa atcagaatgt aaacatgaac tcttgcatat 6900 

atttaaaatt gtgttggaac atttgaacat gaatgctgtt tgggtactta agaaattrat 6960 

tcagtnggat tatcattatg tganactggc agattgcagt gcanccttat gccaataaaa 7020 

tgtaatttar cagccccaga tattgttgaa tattcaacaa ta^icaagaaa agcttttcat 7080 

ctaagtttta tgctttaatt ttttttcttt ttttttcttt ttcttttgtt tccttggtac 7140 
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tataaagctt atttgtattt agtagtgtat 
aagctgttta aatagtgktt aatattgatt 
ttaaagatac tatatacskt ttktatatca 



aaaagaaatn naataaattt attgtttnca aaaaaaaaac ccnntaaaaa 



aaaaagggcg gcccctctag aggatccctc gaggggccc 
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Lys 




1090 








1095 








1100 








Ala 


He 


Ser 


Asn 


He 


Val 


Trp 


Lys 


Tyr 


Gin 


Arg 


Tyr 


His 


Phe 


He 


Met 


1105 








1110 








1115 








1120 


Ala 


Tyr 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Leu 


Ser 










1125 








1130 








1135 


His 


He 


Val 


Ser 


Leu 


Phe 


Cys 


Cys 


He 


Cys 


Lys 


Arg 


Arg 


Lys 


Lys 


Asp 








1140 








1145 








1150 




Lys 


Thr 


Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 


Gin 


Lys 






1155 








1160 








1165 






Lys 


Leu 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


Val 


Glu 


Met 


Tyr 


Phe 


Asn 


Glu 




1170 








1175 








1180 








Lys 


Asp 


Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


1185 








1190 








1195 








1200 


Phe 


Glu 


Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly 


Asp 










1205 








1210 








1215 


Arg 


Val 


Asn 


Tyr 


He 


Lys 


Arg 


Ser 


Leu 


Gin 


Ser 


Leu 


Asp 


Ser 


Gin 


He 








1220 








1225 








1230 




Gly 


His 


Leu 


Gin 


Asp 


Leu 


Ser 


Ala 


Leu 


Thr 


Val 


Asp 


Thr 


Leu 


Lys 


Thr 






1235 








1240 








1245 






Leu 


Thr 


Ala 


Gin 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 


Glu 


He 




1250 








1255 








1260 








Thr 


Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 


Leu 


He 


Asp 


1265 








1270 








1275 








1280 


Asp 


Gly 


Pro 


Val 


Arg 


Pro 


Ser 


Val 


Trp 


Lys 


Lys 


His 


Gly Val 


Val 


Asn 










1285 








1290 








1295 


Thr 


Leu 


Ser 


Ser 


Ser 


Leu 


Pro 


Gin Gly Asp 


Leu 


Glu 


Ser 


Asn 


Asn 


Pro 








1300 








1305 








1310 




Phe 


His 


Cys 


Asn 


He 


Leu 


Met 


Lys 


Asp Asp 


Lys 


Asp 


Pro 


Gin 


Cys 


Asn 



1315 1320 1325 
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Ile Phe Gly Gin Asp Leu Pro Ala Val Pro Gin Arg Lys Glu Phe Asn 

1330 1335 1340 

Phe Pro Glu Ala Gly Ser Ser Ser Gly Ala Leu Phe Pro Ser Ala Val 
1345 1350 1355 1360 

Ser Pro Pro Glu Leu Arg Gin Arg Leu His Gly Val Glu Leu Leu Lys 

1365 1370 1375 

lie Phe Asn Lys Asn Gin Lys Leu Gly Ser Ser Ser Thr Ser lie Pro 

1380 1385 1390 

His Leu Ser Ser Pro Pro Thr Lys Phe Phe Val Ser Thr Pro Ser Gin 

1395 1400 1405 

Pro Ser Cys Lys Ser His Leu Glu Thr Gly Thr Lys Asp Gin Glu Thr 

1410 1415 1420 

Val Cys Ser Lys Ala Thr Glu Gly Asp Asn Thr Glu Phe Gly Ala Phe 
1425 1430 1435 1440 

Val Gly His Arg Asp Ser Met Asp Leu Gin Arg Phe Lys Glu Thr Ser 

1445 1450 1455 

Asn Lys lie Lys lie Leu Ser Asn Asn Asn Thr Ser Glu Asn Thr Leu 

1460 1465 1470 

Lys Arg Val Ser Ser Leu Ala Gly Phe Thr Asp Cys His Arg Thr Ser 

1475 1480 1485 

lie Pro Val His Ser Lys Gin Ala Glu Lys lie Ser Arg Arg Pro Ser 

1490 1495 1500 

Thr Glu Asp Thr His Glu Val Asp Ser Lys Ala Ala Leu lie Pro Asp 
1505 1510 1515 1520 

Trp Leu Gin Asp Arg Pro Ser Asn Arg Glu Met Pro Ser Glu Glu Gly 

1525 1530 1535 

Thr Leu Asn Gly Leu Thr Ser Pro Phe Lys Pro Ala Met Asp Thr Asn 

1540 1545 1550 

Tyr Tyr Tyr Ser Ala Val Glu Arg Asn Asn Leu Met Arg Leu Ser Gin 

1555 1560 1565 

Ser lie Pro Phe Thr Pro Val Pro Pro Arg Gly Glu Pro Val Thr Val 

1570 1575 1580 

Tyr Arg Leu Glu Glu Ser Ser Pro Asn lie Leu Asn Asn Ser Met Ser 
1585 1590 1595 1600 

Ser Trp Ser Gin Leu Gly Leu Cys Ala Lys lie Glu Phe Leu Ser Lys 

1605 1610 1615 . 

Glu Glu Met Gly Gly Gly Leu Arg Arg Ala Val Lys Val Gin Cys Thr 

1620 1625 1630 

Trp Ser Glu His Asp lie Leu Lys Ser Gly His Leu Tyr lie lie Lys 

1635 1640 1645 

Ser Phe Leu Pro Glu Val Val Asn Thr Trp Ser Ser lie Tyr Lys Glu 

1650 1655 1660 

Asp Thr Val Leu His Leu Cys Leu Arg Glu lie Gin Gin Gin Arg Ala 
1665 1670 1675 1680 

Ala Gin Lys Leu Thr Phe Ala Phe Asn Gin Met Lys Pro Lys Ser lie 

1685 1690 1695 

Pro Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His Ser 

1700 1705 1710 

Ala Gly Gin Trp Phe Ala Val Glu Glu Cys Met Thr Gly Glu Phe Arg 

1715 1720 1725 

Lys Tyr Asn Asn Asn Asn Gly Asp Glu lie lie Pro Thr Asn Thr Leu 

1730 1735 1740 

Glu Glu lie Met Leu Ala Phe Ser His Trp Thr Tyr Glu Tyr Thr Arg 
1745 1750 1755 1760 

Gly Glu Leu Leu Val Leu Asp Leu Gin Gly Val Gly Glu Asn Leu Thr 

1765 1770 1775 

Asp Pro Ser Val lie Lys Ala Glu Glu Lys Arg Ser Cys Asp Met Val 

1780 1785 1790 

Phe Gly Pro Ala Asn Leu Gly Glu Asp Ala lie Lys Asn Phe Arg Ala 
1795 1800 1805 
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Lys His His Cys Asn Ser Cys Cys Arg Lys Leu Lys Leu Pro Asp Leu 

1810 1815 1820 

Lys Arg Asn Asp Tyr Thr Pro Asp Lys lie lie Phe Pro Gin Asp Giu 
1825 1830 1835 1840 

Pro Ser Asp Leu Asn Leu Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu 

1845 1850 1855 

Ser Thr Asn Ser Val Arg Leu Met Leu 
1860 1865 

<210> 29 

<211> 4061 

<212> DNA 

<213> Homo Sapiens 



<400> 
ggtctggaag 
ggccgcggca 
aagaagacct 
tgtgggcgcc 
gtggtgaccg 
gagctggact 
acggatccag 
ctggtggtgt 
ctgctgcgtc 
ggtctgcaca 
agcactgggg 
agagacaccc 
ccggaggacg 
gacggcacac 
atctcacagc 
ctgattgatg 
ccatgtctcc 
gacactctgg 
cgtttctttc 
cggaaggagc 
gttttgaagg 
ctgcgtttgg 
gacatccaat 
cggcctgagt 
ccgatgcgcc 
ttggaccagg 
gagctccggc 
ccgaggtacc 
atgtatctgc 
gccccctgga 
tacttctggg 
cgggtgatgg 
ttcaagtttg 
agggctgccc 
ctggccatgc 
acacagaagt 
ttcttttgcc 
cccacacggg 
gggacggcgg 
ggttgctgcg 
ggcgcgccgg 
ttctcgcggg 
ctctatttct 
gggggcagcc 
cgcctctacc 



29 

cagagccggc 
gcatggtggt 
gcacgacgtt 
cccggaccgc 
tgtgggacag 
tcacgggggc 
ctgcagttta 
cagtgctggg 
gtgggctggt 
cgggcatcgg 
gcaccaaggt 
tcatcaaccc 
gggtccagtt 
acggctgcct 
agaagacggg 
gtgatgagaa 
tcgtggctgg 
ccccagggag 
ccaaagggga 
tcctgacagt 
cccttgtgaa 
ctgtggcttg 
ggcggtcctt 
tcgtgcgctt 
tggcccaact 
cgtcccacag 
cccctgacgt 
cctccggggg 
tctcggacaa 
gcgacctgct 
agatgggttc 
cacgcctgga 
aggggatggg 
gcctcctcct 
aagctgacgc 
ggtggggaga 
ctccactcat 
aggagctaga 
acccagccga 
ggggccgctg 
tgaccatctt 
tgctgctcgt 
gggctttcac 
tcgccagcgg 
tcgccgacag 



ggagggagcg 
gccggagaag 
catagttgac 
ccaccccgca 
cgatgcacac 
cggccgcaag 
tagtctggtc 
gggatcgggg 
gcgggctgcc 
ccggcatgtt 
ggtggccatg 
caagggctcg 
tcccctggac 
ggggggcgag 
cgtgggaggg 
gatgttgacg 
ctcaggggga 
tgggggagcc 
ccttgaggtc 
ctattcttct 
ggcctgtggg 
gaaccgcgtg 
ccatctcgaa 
gctcatttcc 
ctacagcgcg 
cgcaggcacc 
ggggcatgtg 
cgcctgggac 
ggccacctcg 
tctttgggca 
caatgcagtt 
gcctgacgct 
cgttgacctc 
ccgtcgctgc 
ccgtgccttc 
tatggccagc 
ctacacccgc 
gtttgacatg 
gaagacgccg 
cggggggcgc 
catgggcaac 
ggatttccag 
gctgctgtgc 
gggccccggg 
ctggaaccag 



ccggggccct 
gagcagagct 
tccacagatc 
gtggccatgg 
accacggaga 
cacagcaatt 
acacgcacat 
ggccccgtcc 
cagagcacag 
ggtgtggctg 
ggtgtggccc 
ttccctgcga 
tacaactact 
aaccgcttcc 
actggaattg 
cgaatagaga 
gctgcggact 
aggcaaggcg 
ctgcaggccc 
gaggatgggt 
agctcggagg 
gacattgccc 
gcttccctca 
cacggcctca 
gcgccctcca 
aaagccccag 
ctgaggatgc 
cctcacccag 
ccgctctcgc 
ctgttgctga 
tcctcagctc 
gaggaggcag 
tttggcgagt 
ccgctctggg 
tttgcccagg 
actacaccca 
ctcatcacct 
gatagtgtca 
ctgggggtcc 
cggtgcctac 
gtggtcagct 
ccggcgccgc 
gaggaactgc 
cctggccatg 
tgcgacctag 



gggctgcagg 
ggatccccaa 
cgggagggac 
aggatgcctt 
agcccaccga 
tcctccggct 
ggggcttccg 

tccagacctg 
gagcctggat 
tacgggacca 
cctggggtgt 
ggtaccggtg 
cggccttctt 
gcttgcgcct 
acatccctgt 
acgccaccca 
gcctggcgga 
aagcccgaga 
aggtggagag 
ctgaggaatt 
cctcagccta 
agagtgaact 
tggacgccct 
gcctgggcca 
actcgctcat 
ccctaaaagg 
tgctggggaa 
gccagggctt 
tggatgctgg 
acagggcaca 
ttggggcctg 
cacggaggaa 
gctatcgcag 
gggatgccac 
atggggtaca 
tctgggccct 
tcaggaaatc 
ttaatgggga 
cgcgccagtc 
gccgctggtt 
acctgctgtt 
ccggctccct 
gccagggcct 
cctcactgag 
tggctctcac 



aggttgcggc 
gatcttcaag 
cttgtgccag 
cggggcagcc 
tgcctacgga 
ctctgaccga 
tgccccgaac 
gctgcaggac 
tgtcactggg 
tcagatggcc 
ggtccggaat 
gcgcggtgac 
cctggtggac 
ggagtcctac 
cctgctcctc 
ggctcagctc 
gaccctggaa 
tcgaatcagg 
gattatgacc 
cgagaccata 
cctggatgag 
ctttcggggg 
gctgaatgac 
cttcctgacc 
ccgcaacctt 
gggagctgcg 
gatgtgcgcg 
cggggagagc 
cctcgggcag 
gatggccatg 
tttgctgctc 
agacctggcg 
cagtgaggtg 
ttgcctccag 
gtctctgctg 
ggttctcgcc 
agaagaggag 
agggcctgtc 
gggccgtccg 
ccacttctgg 
cctgctgctt 
ggagctgctg 
gagcggaggc 
ccagcgcctg 
ctgcttcctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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ctgggcgtgg 
atcgacttca 
gggcccaaga 
ctcggcgtgt 
agtgacttcc 
cagattcccc 
cccggcttct 
aactggctgg 
aacttgctca 
tactggaagg 
ccgcccttta 
cggagccccc 
gccgagcgga 
gctagggaca 
ttggcactga 
cgggaggtcc 
gccttgctgc 
ctgctggcgg 
tctgggcctc 
gggccactgt 
cctcccagaa 
ctggaggctg 
cctcacactg 
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gctgccggct 
tggttttcac 
tcgtcatcgt 
ggctggtagc 
caagtatcct 
aggaggacat 
gggcacaccc 
tggtgctgct 
ttgccatgtt 
cgcagcgtta 
tcgtcatctc 
agccgtcctc 
agctgctaac 
agcgggagag 
aacagctggg 
agcagtgtag 
ccccaggtgg 
acttcaagga 
ggcccccgca 
caggaccacc 
ccagtcccag 
cagggtcctt 
gggaaataaa 



gaccccgggt 
ggtgcggctg 
gagcaagatg 
ctatggcgtg 
gcgccgcgtc 
ggacgtggcc 
tcctggggcc 
cctcgtcatc 
cagttacaca 
ccgcctcatc 
ccacttgcgc 
cccggccctc 
gtgggaatcg 
cgactccgag 
acacatccgc 
ccgcgtcctg 
gccgccaccc 
gaagccccca 
cctggtggcc 
tttgggagtg 
cctgggagga 
ggggtaacag 
gccatttcag 
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ctgtaccacc 
cttcacatct 
atgaaggacg 
gccacggagg 
ttctaccgtc 
ctcatggagc 
caggcgggca 
ttcctgctcg 
ttcggcaaag 
cgggaattcc 
ctcctgctca 
gagcatttcc 
gtgcataagg 
cgtctgaagc 
gagtacgaac 
gggtgggtgg 
cctgacctgc 
caggggattt 
ttgtccttga 
tcatccttac 
tcaaggcctg 
ggaccacaga 
aggaaaaaaa 



tgggccgcac 
tcacggtcaa 
tgttcttctt 
ggctcctgag 
cctacctgca 
acagcaactg 
cctgcgtctc 
tggccaacat 
tacagggcaa 
actctcggcc 
ggcaattgtg 
gggtttacct 
agaactttct 
gcacgtccca 
agcgcctgaa 
ccgaggccct 
ctgggtccaa 
tgctcctaga 
ggtgagcccc 
aaaccacagc 
gatcccgggc 
cccctcacca 
a 



tgtcctctgc 
caaacagctg 
cctcttcttc 
gccacgggac 
gatcttcggg 
ctcgtcggag 
ccagtatgcc 
cctgctggtc 
cagcgatctc 
cgcgctggcc 
caggcgaccc 
ttctaaggaa 
gctggcacgc 
gaaggtggac 
agtgctggag 
gagccgctct 
agactgagcc 
gtaaggctca 
atgtccatct 
atgcccggct 
cgttatccat 
ctcacagatt 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4061 





<210> 


30 




























<211> 


1214 


























<212> 


PRT 




























<213> 


Homo Sapiens 






















<400> 


30 


























Met 


Val 


Val 


Pro 


Glu 


Lys 


Glu 


Gin 


Ser 


Trp 


He 


Pro 


Lys 


He 


Phe 


Lys 


1 








5 










10 










15 




Lys 


Lys 


Thr 


Cys 


Thr 


Thr 


Phe 


He 


Val 


Asp 


Ser 


Thr 


Asp 


Pro 


Gly 


Gly 








20 










25 










30 






Thr 


Leu 


Cys 


Gin 


Cys 


Gly 


Arg 


Pro 


Arg 


Thr 


Ala 


His 


Pro 


Ala 


Val 


Ala 






35 










40 










45 








Met 


Glu 


Asp 


Ala 


Phe 


Gly 


Ala 


Ala 


Val 


Val 


Thr 


Val 


Trp 


Asp 


Ser 


Asp 




50 










55 










60 










Ala 


His 


Thr 


Thr 


Glu 


Lys 


Pro 


Thr 


Asp 


Ala 


Tyr 


Gly 


Glu 


Leu 


Asp 


Phe 


65 










70 










75 










80 


Thr 


Gly 


Ala 


Gly 


Arg 


Lys 


His 


Ser 


Asn 


Phe 


Leu 


Arg 


Leu 


Ser 


Asp 


Arg 










85 










90 










95 




Thr 


Asp 


Pro 


Ala 


Ala 


Val 


Tyr 


Ser 


Leu 


Val 


Thr 


Arg 


Thr 


Trp 


Gly 


Phe 








100 










105 










110 






Arg 


Ala 


Pro 


Asn 


Leu 


Val 


Val 


Ser 


Val 


Leu 


Gly 


Gly 


Ser 


Gly 


Gly 


Pro 






115 










120 










125 








Val 


Leu 


Gin 


Thr 


Trp 


Leu 


Gin 


Asp 


Leu 


Leu 


Arg 


Arg 


Gly 


Leu 


Val 


Arg 




130 










135 










140 










Ala 


Ala 


Gin 


Ser 


Thr 


Gly 


Ala 


Trp 


He 


Val 


Thr 


Gly 


Gly 


Leu 


His 


Thr 


145 










150 










155 










160 


Gly 


He 


Gly 


Arg 


His 


Val 


Gly 


Val 


Ala 


Val 


Arg 


Asp 


His 


Gin 


Met 


Ala 










165 










170 










175 




Ser 


Thr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Ala 


Met 


Gly 


Val 


Ala 


Pro 


Trp 


Gly 








180 










185 










190 






Val 


Val 


Arg 


Asn 


Arg 


Asp 


Thr 


Leu 


He 


Asn 


Pro 


Lys 


Gly 


Ser 


Phe 


Pro 






195 










200 










205 








Ala 


Arg 


Tyr 


Arg 


Trp 


Arg 


Gly 


Asp 


Pro 


Glu 


Asp 


Gly Val 


Gin 


Phe 


Pro 




210 










215 










220 










Leu 


Asp 


Tyr 


Asn 


Tyr 


Ser 


Ala 


Phe 


Phe 


Leu 


Val 


Asp 


Asp 


Gly 


Thr 


His 


225 










230 










235 










240 
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Gly Cys 


Leu 


Gly 


Gly 


Glu 


Asn 


Arg 










245 








lie 


Ser 


Gin 


Gin 


Lys 


Thr 


Gly 


Val 








260 










Val 


Leu 


Leu 


Leu 


Leu 


He 


Asp 


Gly 






275 










280 


Glu 


Asn 


Ala 


Thr 


Gin 


Ala 


Gin 


Leu 




290 










295 




Gly Gly 


Ala 


Ala 


Asp 


Cys 


Leu 


Ala 


305 










310 






Pro Gly 


Ser 


Gly 


Gly 


Ala 


Arg 


Gin 










325 








Arg 


Phe 


Phe 


Pro 


Lys 


Gly Asp 


Leu 








340 










Ara 


lie 


Met 


Thr 


Arg 


Lys 


Glu 


Leu 






355 










360 


Glv 

w J. jr 


Ser 


Glu 


Glu 


Phe 


Glu 


Thr 


He 




370 










375 




Cys 


Glv 


Ser 


Ser 


Glu 


Ala 


Ser 


Ala 


385 










390 






Val 


Ala 


Trp 


Asn Arg 


Val 


Asp 


He 










405 








Asp 


lie 


Gin 


Trp Arg 


Ser 


Phe 


His 








420 










Leu 


Leu 


Asn 


Asp 


Arg 


Pro 


Glu 


Phe 






435 










440 


Leu 


Ser 


Leu 


Gly 


His 


Phe 


Leu 


Thr 




450 










455 




Ser 


Ala 


Ala 


Pro 


Ser 


Asn 


Ser 


Leu 


465 










470 






Ser 


His 


Ser 


Ala 


Gly 
485 


Thr 


Lys 


Ala 


Glu 


Leu 


Arg 


Pro 


Pro 


Asp 


Val 


Gly 








500 










Lys 


Met 


Cys 


Ala 


Pro 


Arg 


Tyr 


Pro 






515 










520 


Pro 


Gly 


Gin 


Gly 


Phe 


Gly 


Glu 


Ser 




530 










535 




Thr 


Ser 


Pro 


Leu 


Ser 


Leu 


Asp 


Ala 


545 










550 






Asp 


Leu 


Leu 


Leu 


Trp 


Ala 


Leu 


Leu 










565 








Tyr 


Phe 


Trp 


Glu 


Met 


Gly 


Ser 


Asn 








580 










Cys 


Leu 


Leu 


Leu 


Arg 


Val 


Met 


Ala 






595 










600 


Ala 


Ala 


Arg 


Arg 


Lys 


Asp 


Leu 


Ala 




610 










615 




Asp 


Leu 


Phe 


Gly 


Glu 


Cys 


Tyr 


Arg 


625 










630 






Leu 


Leu 


Leu 


Arg 


Arg 


Cys 


Pro 


Leu 










645 








Leu 


Ala 


Met 


Gin 


Ala 


Asp 


Ala 


Arg 








660 










Gin 


Ser 


Leu 


Leu 


Thr 


Gin 


Lys 


Trp 






675 










680 


Pro 


lie 


Trp 


Ala 


Leu 


Val 


Leu 


Ala 




690 










695 




Thr 


Arg 


Leu 


He 


Thr 


Phe 


Arg 


Lys 


705 










710 
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Phe 


Arg 


Leu 


Arg 


Leu 


Glu 


Ser 


Tyr 




250 










255 




Gly 


Gly 


Thr 


Gly 


He 


Asp 


He 


Pro 


265 










270 






Asp 


Glu 


Lys 


Met 


Leu 


Thr 


Arg 


He 










285 








Pro 


Cys 


Leu 


Leu 


Val 


Ala 


Gly 


Ser 








300 










Glu 


Thr 


Leu 


Glu 


Asp 


Thr 


Leu 


Ala 






315 










320 


Gly 


Glu 


Ala 


Arg 


Asp Arg 


He 


Arg 




330 










335 




Glu 


Val 


Leu 


Gin 


Ala 


Gin 


Val 


Glu 


345 










350 






Leu 


Thr 


Val 


Tyr 


Ser 


Ser 


Glu Asp 










365 








Val 


Leu 


Lys 


Ala 


Leu 


Val 


Lys 


Ala 








380 










Tyr 


Leu 


Asp 


Glu 


Leu 


Arg 


Leu 


Ala 






395 










400 


Ala 


Gin 


Ser 


Glu 


Leu 


Phe 


Arg 


Gly 




410 










415 




Leu 


Glu 


Ala 


Ser 


Leu 


Met 


Asp Ala 


425 










430 






Val 


Arg 


Leu 


Leu 


He 


Ser 


His 


Gly 










445 








Pro 


Met 


Arg 


Leu 


Ala 


Gin 


Leu 


Tyr 








4 60 










He 


Arg 


Asn 


Leu 


Leu 


Asp 


Gin 


Ala 






475 










480 


Pro 


Ala 


Leu 


Lys 


Gly 


Gly 


Ala 


Ala 




490 










495 




His 


Val 


Leu 


Arg 


Met 


Leu 


Leu 


Gly 


505 










510 






Ser 


Gly 


Gly 


Ala 


Trp 


Asp 


Pro 


His 










525 








Met 


Tyr 


Leu 


Leu 


Ser 


Asp 


Lys 


Ala 








540 










Gly 


Leu 


Gly Gin 


Ala 


Pro 


Trp 


Ser 






555 










560 


Leu 


Asn 


Arg 


Ala 


Gin 


Met 


Ala 


.Met 




570 










575 




Ala 


Val 


Ser 


Ser 


Ala 


Leu 


Gly 


Ala 


585 










590 






Arg 


Leu 


Glu 


Pro 


Asp 


Ala 


Glu 


Glu 










605 








Phe 


Lys 


Phe 


Glu 


Gly Met 


Gly 


Val 








620 










Ser 


Ser 


Glu 


Val 


Arg 


Ala 


Ala 


Arg 






635 










640 


Trp 


Gly Asp Ala 


Thr 


Cys 


Leu 


Gin 




650 










655 




Ala 


Phe 


Phe 


Ala 


Gin Asp 


Gly Val 


665 










670 






Trp 


Gly Asp Met 


Ala 


Ser 


Thr 


Thr 










685 








Phe 


Phe 


Cys 


Pro 


Pro 


Leu 


He 


Tyr 








700 










Ser 


Glu 


Glu 


Glu 


Pro 


Thr 


Arg 


Glu 






715 










720 
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Glu 


Leu 


Glu 


Phe 


Asp 


Met 


Asp 


Ser 


Val 


He 


Asn 


Gly 


Glu 


Gly 


Pro 


Val 










725 










730 










735 




Gly 


Thr 


Ala 


Asp 


Pro 


Ala 


Glu 


Lys 


Thr 


Pro 


Leu 


Gly 


Val 


Pro 


Arg 


Gin 








740 










745 










750 






Ser 


Gly 


Arg 


Pro 


Gly 


Cys 


Cys 


Gly 


Gly 


Arg 


Cys 


Gly 


Gly 


Arg 


Arg 


Cys 






755 










7 60 










765 








Leu 


Arg 


Arg 


Trp 


Phe 


His 


Phe 


Trp 


Gly 


Ala 


Pro 


Val 


Thr 


He 


Phe 


Met 




770 










775 










780 










Gly 


Asn 


Val 


Val 


Ser 


Tyr 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Phe 


Ser 


Arg 


Val 


785 










790 










795 










800 


Leu 


Leu 


Val 


Asp 


Phe 


Gin 


Pro 


Ala 


Pro 


Pro 


Gly 


Ser 


Leu 


Glu 


Leu 


Leu 










805 










810 










815 




Leu 


Tyr 


Phe 


Trp 


Ala 


Phe 


Thr 


Leu 


Leu 


Cys 


Glu 


Glu 


Leu 


Arg 


Gin 


Gly 








820 










825 










830 






Leu 


Ser 


Gly Gly 


Gly 


Gly 


Ser 


Leu 


Ala 


Ser Gly Gly 


Pro 


Gly 


Pro 


Gly 






835 










840 










845 








His 


Ala 


Ser 


Leu 


Ser 


Gin 


Arg 


Leu 


Arg 


Leu 


Tyr 


Leu 


Ala 


Asp 


Ser 


Trp 




850 










855 










860 










Asn 


Gin 


Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly 


Val 


Gly 


865 










870 










875 










880 


Cys 


Arg 


Leu 


Thr 


Pro 


Gly 


Leu 


Tyr 


His 


Leu Gly Arg 


Thr 


Val 


Leu 


Cys 










885 










890 










895 




He 


Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 






900 










905 










910 






Asn 


Lys 


Gin 


Leu 


Gly 


Pro 


Lys 


He 


Val 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 






915 










920 










925 








Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Gly 


Val 


Trp 


Leu 


Val 


Ala 


Tyr 




930 










935 










940 










Gly 


Val 


Ala 


Thr 


Glu 


Gly 


Leu 


Leu 


Arg 


Pro Arg 


Asp 


Ser 


Asp 


Phe 


Pro 


945 










950 










955 










960 


Ser 


He 


Leu 


Arg 


Arg 


Val 


Phe 


Tyr Arg 


Pro 


Tyr 


Leu 


Gin 


He 


Phe 


Gly 










965 










970 










975 




Gin 


He 


Pro 


Gin 


Glu 


Asp 


Met 


Asp 


Val 


Ala 


Leu 


Met 


Glu 


His 


Ser 


Asn 








980 










985 










990 






Cys 


Ser 


Ser 


Glu 


Pro 


Gly 


Phe 


Trp 


Ala 


His 


Pro 


Pro 


Gly 


Ala 


Gin 


Ala 






995 










1000 








1005 






Gly 


Thr 


Cys 


Val 


Ser 


Gin 


Tyr 


Ala 


Asn 


Trp 


Leu 


Val 


Val 


Leu 


Leu 


Leu 




1010 








1015 








1020 








Val 


He 


Phe 


Leu 


Leu 


Val 


Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


1025 








1030 








1035 








1040 


Ala 


Met 


Phe 


Ser 


Tyr 


Thr 


Phe 


Gly 


Lys 


Val 


Gin 


Gly 


Asn 


Ser 


Asp 


Leu 










1045 








1050 








1055 


Tyr 


Trp 


Lys 


Ala 


Gin 


Arg 


Tyr 


Arg 


Leu 


He 


Arg 


Glu 


Phe 


His 


Ser 


Arg 








1060 








1065 








1070 




Pro 


Ala 


Leu 


Ala 


Pro 


Pro 


Phe 


He 


Val 


He 


Ser 


His 


Leu 


Arg 


Leu 


Leu 






1075 








1080 








1085 






Leu 


Arg 


Gin 


Leu 


Cys 


Arg 


Arg 


Pro 


Arg 


Ser 


Pro 


Gin 


Pro 


Ser 


Ser 


Pro 




1090 








1095 








1100 








Ala 


Leu 


Glu 


His 


Phe 


Arg 


Val 


Tyr 


Leu 


Ser 


Lys 


Glu 


Ala 


Glu 


Arg 


Lys 


1105 








1110 








1115 








1120 


Leu 


Leu 


Thr 


Trp 


Glu 


Ser 


Val 


His 


Lys 


Glu 


Asn 


Phe 


Leu 


Leu 


Ala 


Arg 










1125 








1130 








1135 


Ala 


Arg 


Asp 


Lys 


Arg 


Glu 


Ser 


Asp 


Ser 


Glu Arg 


Leu 


Lys 


Arg 


Thr 


Ser 








1140 








1145 








1150 




Gin 


Lys 


Val 


Asp 


Leu 


Ala 


Leu 


Lys 


Gin 


Leu 


Gly His 


He 


Arg 


Glu 


Tyr 






1155 








1160 








1165 






Glu 


Gin 


Arg 


Leu 


Lys 


Val 


Leu Glu Arg 


Glu 


Val 


Gin 


Gin 


Cys 


Ser 


Arg 




1170 








1175 








1180 








Val 


Leu 


Gly Trp Val 


Ala 


Glu 


Ala 


Leu 


Ser Arg 


Ser 


Ala 


Leu 


Leu 


Pro 



1185 1190 1195 1200 



wo 00/40614 



PCT/US99/29996 



-49- 



Pro Gly Gly Pro Pro Pro Pro Asp Leu Pro Gly Ser Lys Asp 
1205 1210 



<210> 31 

<211> 4646 

<212> DNA 

<213> Homo Sapiens 



<400> 
tcgacccacg 
tccgcccacg 
agatgtcctt 
gcacccggac 
acttggtgaa 
attccaaggc 
gcacccagat 
ccgacgcctt 
tgtcctgcga 
aaacacccaa 
gcatgcgcaa 
tcacgggagg 
ccatcagcag 
tctccaaccg 
accttatgga 
tgctgctcgt 
agctagagaa 
ccattgtgtg 
tcaaaaataa 
ctagcctggt 
gctttttacc 
ggctcaaaga 
gggatgaaat 
agcaagacaa 
acttagccaa 
aagtcatgtt 
atggcttgaa 
acttcagcac 
tcctcacgtt 
gaaatggccg 
ccctgcaagc 
tttgggagca 
ctctggccaa 
agtacgagac 
cagaacagct 
cggtggaggc 
agcaatggta 
ttattatacc 
acaagaagct 
ggaatgtggt 
tccattcggt 
gtgatgaagt 
tgatggacac 
ctaataaaag 
ctctaagatt 
tgcagaggat 
cctttggcgt 
tccgttcggt 
atggtaccac 
gtgtggagct 
tgtgcatcta 



31 

cgtccgccca 
cgtccggggt 
tcgggcagcc 
cctgtactcc 
ttttattcaa 
cacggagaat 
caaccaaagt 
tggggatatt 
cacggacgcg 
cctggtcatt 
gatcttcagc 
cacccattat 
gagttcagag 
ggacaccctc 
tgacttcaca 
ggacaatggc 
gtatatctct 
ttttgcccaa 
aattccttgt 
ggaggtggag 
ccgcacggtg 
aattctcgaa 
tgtgagcaat 
ggataactgg 
tgatgagatt 
tacggctctc 
cctacggaag 
gcttgtgtac 
tgtctggaaa 
ggacgagatg 
tctcttcatc 
gaccaggggc 
agtgaagaac 
ccgggctgtt 
gctggtctat 
cacagaccag 
tggagagatt 
cttggtgggc 
gctttggtac 
cttctacatc 
gccacacccc 
'gagacagtgg 
gctggggctt 
ctctttgtat 
gatccacatt 
gctgatcgat 
ggccaggcaa 
catctacgag 
gtatgacttt 
ggatgagcac 
catgttatcc 



cgcgtccgcc 
gaaagmramy 
aggctcagca 
agcgcgtctc 
gcaaatttta 
gtgtgcaagt 
gagaaatgga 
cagtttgaga 
gaaatccttt 
tctgtgaccg 
cggctcatct 
ggcctgatga 
gagaatattg 
atcaggaatt 
agagatccac 
tgtcatggac 
gagcgcacta 
ggaggtggaa 
gtggtggtgg 
gatgccctga 
tcccggctgc 
tgttctcacc 
gccatctcct 
aatgggcagc 
ttcaccaatg 
ataaaggaca 
tttctcaccc 
cggaatctgc 
ctggttgcga 
gacatagaac 
tgggccattc 
tgcactctgg 
gacatcaatg 
gagctgttca 
tcctgtgaag 
catttcatcg 
tcccgagaca 
tgtggctttg 
tatgtggcgt 
gccttcctcc 
cccgagctgg 
tacgtaaatg 
ttttacttca 
tctggacgag 
tttactgtaa 
gtgttcttct 
gggatcctta 
ccctacctgg 
gcccactgca 
aacctgcccc 
accaacatcc 



cacgcgtccg 
cmygcktsms 
tgaggaacag 
ggagcacaga 
agaaacgaga 
gtggctatgc 
actacaagaa 
cactggggaa 
acgagctgct 
ggggcgccaa 
acatcgcgca 
agtacatcgg 
tggccattgg 
gcgatgctga 
tgtgtatcct 
atcccactgt 
ttcaagattc 
aagagacttt 
aaggctcggg 
catcttctgc 
ctgaggagga 
tattaacagt 
acgctctata 
tgaagcttct 
accgccgatg 
gacccaagtt 
atgatgtcct 
agatcgccaa 
acttccgaag 
tccacgacgt 
ttcagaataa 
cagccctggg 
ctgctgggga 
ctgagtgtta 
cttggggtgg 
cccagcctgg 
ccaagaactg 
tatcatttag 
tcttcacctc 
tgctgtttgc 
tcctgtactc 
gggtgaatta 
tagcaggaat 
tcattttctg 
gcagaaactt 
tcctgttcct 
ggcagaatga 
ccatgttcgg 
ccttcactgg 
ggttccccga 
tgctggtcaa 



cccacgcgtc 
aaaaaccgtc 
aaggaatgac 
cttgtcttac 
atgtgtcttc 
ccagagccag 
acacaccaag 
gaaagggaag 
gacccagcac 
gaacttcgcc 
gtccaaaggt 
ggaggtggtg 
catagcagct 
gggctatttt 
ggacaacaac 
cgaagcaaag 
caactatggt 
gaaagccatc 
ccagatcgct 
cgtcaaggag 
gactgagagt 
tattaaaatg 
caaagccttc 
gctggagtgg 
ggagtctgct 
tgtccgcctc 
cactgaactc 
gaattcctat 
aggcttccgg 
gtctcctatt 
gaaggaactc 
agccagcaag 
gtccgaggag 
cagcagcgat 
aagcaactgt 
ggtccagaat 
gaagattatc 
gaagaaacct 
ccccttcgtg 
ctacgtgctg 
gctggtcttt 
ttttactgac 
tgtatttcgg 
tctggactac 
aggacccaag 
ctttgcggtg 
gcagcgctgg 
ccaggtgccc 
gaatgagtcc 
gtggatcacc 
cctgctggtc 



cgcccacgcg 
acttaggaaa 
actctggaca 
agtgaaagcg 
tttaccaaag 
cacatggaag 
gaatttccta 
tatatacgtc 
tggcacctga 
ctgaagccgc 
gcttggattc 
agagataaca 
tggggcatgg 
ttagcccagt 
cacacacatt 
ctccggaatc 
ggcaagatcc 
aatacctcca 
gatgtgatcg 
aagctggtgc 
tggatcaaat 
gaagaagctg 
agcaccagtg 
aaccagctgg 
gaccttcaag 
tttctggaga 
ttctccaacc 
aatgatgccc 
aaggaagaca 
actcggcacc 
tccaaagtca 
cttctgaaga 
ctggctaatg 
gaagacttgg 
ctggagctgg 
tttctttcta 
ctgtgtctgt 
gtcgacaagc 
gtcttctcct 
ctcatggatt 
gtcctcttct 
ctgtggaatg 
ctccactctt 
attattttca 
attataatgc 
tggatggtgg 
aggtggatat 
agtgacgtgg 
aagccactgt 
atccccctgg 
gccatgtttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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gctacacggt gggcaccgtc caggagaaca atgaccaggt ctggaagttc cagaggtact 3120 

tcctggtgca ggagtactgc agccgcctca atatcccctt ccccttcatc gtcttcgctt 3180 

acttctacat ggtggtgaag aagtgcttca agtgttgctg caaggagaaa aacatggagt 3240 

cttctgtctg ctgtttcaaa aatgaagaca atgagactct ggcatgggag ggtgtcatga 3300 

aggaaaacta ccttgtcaag atcaacacaa aagccaacga cacctcagag gaaatgaggc 3360 

atcgatttag acaactggat acaaagctta atgatctcaa gggtcttctg aaagagattg 3420 

ctaataaaat caaataaaac tgtatgaact ctaatggaga aaaatctaat tatagcaaga 3480 

tcatattaag gaatgctgat gaacaatttt gctatcgact actaaatgag agattttcag 3540 

acccctgggt acatggtgga tgattttaaa tcaccctagt gtgctgagac cttgagaata 3600 

aagtgtgtga ttggtttcat acttgaagac ggatataaag gaagaatatt tcctttatgt 3660 

gtttctccag aatggtgcct gtttctctct gtgtctcaat gcctgggact ggaggttgat 3720 

agtttaagtg tgttcttacc gcctcctttt tcctttaatc ttatttttga tgaacacata 3780 

tataggagaa catctatcct atgaataaga acctggtcat gctttactcc tgtattgtta 3840 

ttttgttcat ttccaattga ttctctactt ttcccttttt tgtattatgt gactaattag 3900 

ttggcatatt gtwaaaagtc tctcaaatta ggccagattc taaaacatgc tgcagcaaga 3960 

ggaccccgct ctcttcagga aaagtgtttt catttctcag gatgcttctt acctgtcaga 4020 

ggaggtgaca aggcagtctc ttgctctctt ggact caeca ggctcctatt gaaggaacca 4 080 

cccccattcc taaatatgtg aaaagtcgcc caaaatgcaa ccttgaaagg cactactgac. 4140 

tttgttctta ttggatactc ctcttattta ttatttttcc attaaaaata atagctggct 4200 

attatagaaa atttagacca tacagagatg tagaaagaac ataaattgtc cccattacct 4260 

taaggtaatc actgctaaca atttctggat ggtttttcaa gtctattttt tttctatgta 4 320 

tgtctcaatt ctctttcaaa attttacaga atgttatcat actacatata tactttttat 4380 

gtaagctttt tcacttagta ttttatcaaa tatgttttta ttatattcat agccttctta 4 44 0 

aacattatat caataattgc ataataggca acctctagcg attaccataa ttttgctcat 4500 

tgaaggctat ctccagttga tcattgggat gagcatcttt gtgcatgaat cctattgctg 4560 

tatttgggaa aattttccaa ggttagattc caataaatat ctatttatta ttaaaaaaaa 4 620 

aaaaaaaagg gcggccgctc tagagt 4 64 6 



<210> 32 

<211> 1104 

<212> PRT 

<213> Homo Sapiens 





<400> 


32 










Met 


Ser 


Phe 


Arg 


Ala 


Ala 


Arg 


Leu 


1 








5 








Thr 


Leu 


Asp 


Ser 
20 


Thr 


Arg 


Thr 


Leu 


Asp 


Leu 


Ser 
35 


Tyr 


Ser 


Glu 


Ser 


Asp 
40 


Phe 


Lys 
50 


Lys 


Arg 


Glu 


Cys 


Val 
55 


Phe 


Glu 


Asn 


Val 


Cys 


Lys 


Cys 


Gly 


Tyr 


65 










70 






Thr 


Gin 


He 


Asn 


Gin 
85 


Ser 


Glu 


Lys 


Glu 


Phe 


Pro 


Thr 
100 


Asp 


Ala 


Phe 


Gly 


Lys 


Lys 


Gly 
115 


Lys 


Tyr 


He 


Arg 


Leu 
120 


Leu 


Tyr 
130 


Glu 


Leu 


Leu 


Thr 


Gin 
135 


His 


Val 


lie 


Ser 


Val 


Thr 


Gly 


Gly 


Ala 


145 










150 






Met 


Arg 


Lys 


He 


Phe 
165 


Ser 


Arg 


Leu 


Ala 


Trp 


He 


Leu 
180 


Thr 


Gly 


Gly 


Thr 


Gly 


Glu 


Val 
195 


Val 


Arg 


Asp 


Asn 


Thr 
200 



Ser 


Met 


Arg 


Asn 


Arg 


Arg 


Asn 


Asp 




10 










15 




Tyr 


Ser 


Ser 


Ala 


Ser 


Arg 


Ser 


Thr 


25 










30 






Leu 


Val 


Asn 


Phe 


He 


Gin 


Ala 


Asn 










45 








Phe 


Thr 


Lys 


Asp 


Ser 


Lys 


Ala 


Thr 








60 










Ala 


Gin 


Ser 


Gin 


His 


Met 


Glu 


Gly 






75 










80 


Trp 


Asn 


Tyr 


Lys 


Lys 


His 


Thr 


Lys 




90 










95 




Asp 


He 


Gin 


Phe 


Glu 


Thr 


Leu 


Gly 


105 










110 






Ser 


Cys 


Asp 


Thr 


Asp 


Ala 


Glu 


He 










125 








Trp 


His 


Leu 


Lys 


Thr 


Pro 


Asn 


Leu 








140 










Lys 


Asn 


Phe 


Ala 


Leu 


Lys 


Pro 


Arg 






155 










160 


He 


Tyr 


He 


Ala 


Gin 


Ser 


Lys 


Gly 




170 










175 




His 


Tyr 


Gly 


Leu 


Met 


Lys 


Tyr 


He 


185 










190 






He 


Ser 


Arg 


Ser 


Ser 


Glu 


Glu 


Asn 



205 
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lie 


Val 


Ala 


He 


Gly 


He 




210 










Thr 


Leu 


He 


Arg 


Asn 


Cys 


225 










230 


Leu 


Met 


Asp 


Asp 


Phe 


Thr 










245 




His 


Thr 


His 


Leu 


Leu 


Leu 








260 






Val 


Glu 


Ala 


Lys 


Leu 


Arg 






275 








Thr 


He 


Gin 


Asp 


Ser 


Asn 




290 










Ala 


Gin 


Gly Gly Gly 


Lys 


305 










310 


Lys 


Asn 


Lys 


He 


Pro 


Cys 










325 




Asp 


Val 


He 


Ala 


Ser 


Leu 






340 






Ala 


Val 


Lys 


Glu 


Lys 


Leu 






355 








Leu 


Pro 


Glu 


Glu 


Glu 


Thr 




370 










Leu 


Glu 


Cys 


Ser 


His 


Leu 


385 










390 


Asp 


Glu 


He. Val 


Ser 


Asn 










405 




Ser 


Thr 


Ser 


Glu 


Gin 


Asp 








420 






Leu 


Leu 


Glu 


Trp 


Asn 


Gin 






435 








Asn 


Asp 


Arg 


Arg 


Trp 


Glu 




450 










Ala 


Leu 


He 


Lys 


Asp 


Arg 


465 










470 


Gly 


Leu 


Asn 


Leu 


Arg 


Lys 










485 




Phe 


Ser 


Asn 


His 


Phe 


Ser 








500 






Lys 


Asn 


Ser 


Tyr 


Asn 


Asp 






515 








Ala 


Asn 


Phe 


Arg 


Arg 


Gly 




530 










Glu 


Met 


Asp 


He 


Glu 


Leu 


545 










550 


Leu 


Gin 


Ala 


Leu 


Phe 


He 










565 




Ser 


Lys 


Val 


He 


Trp 


Glu 








580 






Gly Ala 


Ser 


Lys 


Leu 


Leu 






595 








Asn 


Ala 


Ala 


Gly 


Glu 


Ser 




610 










Ala 


Val 


Glu 
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CHARACTERIZATION OF A CALCIUM CHANNEL FAMILY 

Field of the Invenrion 

This invention relates to nucleic acids coding for a novel family of calcium channel 
polypeptides, the encoded polypeptides, unique fragments of the foregoing, and methods of 
making and using same. 

Background of the Invention 

Calcium channels are membrane-spanning, multi-subunit proteins that facilitate the 
contmlled transport ("flux") of Ca^"*^ ions into and out of cells. Cells throughout the animal 
kingdom, and at least some bacterial, fungal and plant cells, possess one or more types of 
calcium channels. In general, "excitable" cells, such as neurons of the central nervous 
system, peripheral nerve cells, and muscle cells, including those of skeletal muscles, cardiac 
muscles, and venous and arterial smooth muscles, possess voltage-dependent calcium 
channels. In a voltage-dependent calcium channel, the transport of Ca^* ions into and out of 
the cells requires a certain minimal level of depolarization (the difference in potential between 
the inside of the cell bearing the channel and the extracellular enviroimient) with the rate of 
Ca^"*^ cell flux dependent on the difference in potential. In "non-excitable" cells, calcium 
influx is thought to occur predominantly in response to stimuli which cause the release of 
calcium from intracellular stores. This process, termed store operated calcium influx, is not 
well understood. 

Characterization of a particular type of calcium channel by analysis of whole cells is 
complicated by the presence of mixed populations of different types of calcium charmels in 
the majority of cells. Although single-channel recording methods can be used to examine 
individual calcium chaimels, such analysis does not reveal information related to the 
molecular structure or biochemical composition of the channel. Furthermore, in this type of 
analysis, the channel is isolated from other cellular constituents that might be important for 
the chaimers natural functions and pharmacological interactions. To study the calcium 
channel structure-function relationship, large amoimts of pure channel protein are needed. 
However, acquiring large amounts of pure protein is difficult in view of the complex nature of 
these multisubunit proteins, the varying concentrations of calcium channel proteins in tissue 
sources, the presence of mixed populations of calcium channel proteins in tissues, and the 
modifications of the native protein that can occur during the isolation procedure. 
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Summarv^ of the Invention 

The invention is based on the identification of a novel family of calcium channel 
polypeptides and the molecular cloning and partial characterization of a novel member of this 
family that is expressed predominantly in human hematopoietic cells, liver, and kidney. This 
newly identified family of calcium channel polypeptides is designated, "SOC" or "CRAC" or 
"ICRAC", for Store Operated Channels or Calcium Release Activated Channels. Although 
not washing to be bound to any particular theory or mechanism, it is believed that the 
SOC/CRAC calcium channel polypeptides are transmembrane polypeptides that modulate 
Ca^"*" flux "into" and "out of a cell, for example, in certain instances they may be activated 
upon depletion of Ca^"*^ from intracellular calcium stores, allowing Ca^"^ influx into the cell. 
Accordingly, the compositions disclosed herein are believed to be useful for modulating 
calcium transport into and out of such intracellular stores and for the treatment of disorders 
that are characteri2:ed by aberrant calcium transport into and out of such intracellular stores. 
In particular, we believe that the SOC/CRAC calcium channel polypeptides disclosed herein 
play an important role in the influx of extracellular calcium by mediating the refilling of 
intracellular calcium stores following their depletion. Accordingly, we believe that the 
compositions for expressing functional SOC/CRAC calcium channel polypeptides in cells, as 
disclosed herein, are useful for treating patients having conditions that are characterized by 
reduced extracellular calcium influx into their SOC/CRAC-expressing cells. Additionally, the 
compositions of the invention are useful for delivering therapeutic and/or imaging agents to 
cells which preferentially express SOC/CRAC calcium channel polypeptides and, in 
particular, for delivering such agents to hematopoietic cells, liver, heart, spleen, and kidney to 
modulate proliferation and growth of these cells. Moreover, in view of the importance of 
cellular calcium levels to cell viability, we believe that SOC-2/CRAC-1, SOC-3/CRAC-2, and 
SOC-4/CRAC-3 as disclosed herein, and/or other members of the SOC/CRAC family of 
calcium channel polypeptides, represent an ideal target for designing and/or identifying (e.g., 
from molecular libraries) small molecule inhibitors that block lymphocyte proliferation, as 
well as other binding agents that selectively bind to SOC/CRAC polypeptides to which drugs 
or toxins can be conjugated for delivery to SOC/CRAC polypeptide expressing cells. 

The invention is based, in part, on the molecular cloning and sequence analysis of the 
novel SOC/CRAC calcium channel molecules disclosed herein (also referred to as a "SOC- 
2/CRAC-l molecule," a "SOC-3/CRAC-2 molecule." and/or "SOC-4/CRAC-3 molecule") 
that are predominantly expressed in human hematopoietic cells, liver, spleen, heart, and 
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kidney (SOC-2/CRAC-1), kidney and colon (SOC-3/CRAC-2), and prostate. (SOC-4/CRAC-3 
molecule). As used herein, a "SOC/CRAC molecule" embraces a "SOC/CRAC calcium 
channel nucleic acid" (or "SOC/CRAC nucleic acid") and a "SOC/CRAC calcium channel 
polypeptide" (or "SOC/CRAC polypeptide"). Homologs and alleles also are embraced within 
the meaning of a SOC/CRAC calcium channel molecule. 

According to one aspect of the invention, isolated SOC/CRAC nucleic acids which 
code for one or more member(s) of the SOC/CRAC family of calcium channel polypeptides 
or unique fragments thereof are provided. The isolated nucleic acids refer to one or more of 
the following: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic acid 
molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27. SEQ ID NO:29, and SEQ ID 
NO:3 1 , and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

The invention in another aspect provides an isolated nucleic acid molecule selected 
from the group consisting of (a) a imique fragment of a nucleic acid molecule selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:29, and SEQ ID NO:31, (b) complements of (a), 
provided that the unique fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from a sequence group consisting of (1) sequences having 
the SEQ. ID NOS. or GenBank accession numbers of Table I, (2) complements of (1), and (3) 
fragments of (1) and (2). 

According to yet another aspect of the invention, isolated SOC/CRAC polypeptides 
are provided. The isolated SOC/CRAC polypeptide molecules are encoded by one or more 
SOC/CRAC nucleic acid molecules of the invention. Preferably, the SOC/CRAC polypeptide 
contains one or more polypeptides selected from the group consisting of the polypeptides 
having SEQ. ID Nos. 2, 4, 6, 8, 24, 26, 28, 30, and 32. In other embodiments, the isolated 
polypeptide may be a fragment or variant of the foregoing SOC/CRAC polypeptide molecules 
of sufBcient length to represent a sequence unique within the human genome, and identifying 
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with a polypeptide that functions as a calcium channel, provided that the fragment excludes a 
sequence of contiguous amino acids identified in Table IL and/or excludes a sequence of 
contiguous amino acids encoded for by a nucleic acid sequence identified in Table I. In 
another embodiment, immunogenic fragments of the polypeptide molecules described above 
are provided. 

According to another aspect of the invention, isolated SOC/CRAC binding agents 
(e.g., polypeptides) are provided which selectively bind to a SOC/CRAC molecule (e.g., a 
SOC/CRAC polypeptide encoded by the isolated nucleic acid molecules of the invention). 
Preferably, the isolated binding agents selectively bind to a polypeptide which comprises the 
sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID 
NO:32, or unique fragments thereof. In the preferred embodiments, the isolated binding 
polypeptides include antibodies and fragments of antibodies (e.g.. Fab, F(ab)2, Fd and 
antibody fragments which include a CDR3 region which binds selectively to a SOC/CRAC 
polypeptide). Preferably, the antibodies for human therapeutic applications are human 
antibodies. 

According to another aspect of the invention, a pharmaceutical composition containing 
a pharmaceutically effective amount of an isolated SOC/CRAC nucleic acid, an isolated 
SOC/CRAC polypeptide, or an isolated SOC/CRAC binding polypeptide in a 
pharmaceutically acceptable carrier also is provided. The pharmaceutical compositions are 
useful in accordance with therapeutic methods disclosed herein. 

According to yet another aspect of the invention, a method for isolating a SOC/CRAC 
molecule is provided. The method involves: 

a) contacting a SOC/CRAC nucleic acid or a SOC/CRAC binding polypeptide with a 
sample that is believed to contain one or more SOC/CRAC molecules, under conditions to 
form a complex of the SOC/CRAC nucleic acid or the SOC/CRAC binding polypeptide and 
the SOC/CRAC molecule; 

b) detecting the presence of the complex; 

c) isolating the SOC/CRAC molecule from the complex; and 

d) determining whether the isolated SOC/CRAC molecule has SOC/CRAC calcium 
channel activit>'. As used herein "SOC/CRAC calciimi channel activity" refers to the 
transport of Ca^"*^ into and out of intracellular stores that is mediated by a SOC/CRAC 
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polypeptide. In general, the SOC/CRAC calcium channel activity is initiated by a reduction 
or depletion of intracellular calcium stores. 

In certain embodiments, the SOC/CRAC nucleic acid is a SOC-2/CRAC-1 nucleic 
acid (e.g., a nucleic acid having SEQ. ID NO. 27, or complements thereof); in certain other 
embodiments, the SOC/CRAC nucleic acid is a SOC-3/CRAC-2 nucleic acid (e.g., a nucleic 
acid having SEQ. ID NO. 29, or complements thereof); in further embodiments, the 
SOC/CRAC nucleic acid is a SOC-4/CRAC-3 nucleic acid (e.g., a nucleic acid having SEQ, 
ID NO. 31, or complements thereof). In yet other embodiments, the SOC/CRAC polypeptide 
is a SOC-2/CRAC-1 binding polypeptide (e.g., an antibody that selectively binds to a SOC- 
2/CRAC-l polypeptide). In yet further embodiments, the SOC/CRAC polypeptide is a SOC- 
3/CRAC-2 binding polypeptide (e.g., an antibody that selectively binds to a SOC-3/CRAC-2 
polypeptide). In some embodiments, the SOC/CRAC polypeptide is a SOC-4/CRAC-3 
binding polypeptide (e.g., an antibody that selectively binds to a SOC-4/CRAC-3 
polypeptide). In the preferred embodiments, the isolated binding polypeptides include 
antibodies and fragments of antibodies (e.g.. Fab, F(ab)2, Fd and antibody fragments which 
include a CDR3 region which binds selectively to a SOC-2/CRAC-1, to a SOC-3/CRAC-2, 
and/or to a SOC-4/CRAC-3 polypeptide). Preferably the isolated binding polypeptides or 
other binding agents selectively bind to a single SOC/CRAC molecule, i.e., are capable of 
distinguishing between different members of the SOC/CRAC family. Accordingly, one or 
more SOC/CRAC binding agents can be contained in a single composition (e.g., a 
pharmaceutical composition) to identify multiple SOC/CRAC molecules in vivo or in vitro. 

According to yet another aspect of the invention, a method for identifying agents 
useful in the modulation of SOC/CRAC calcium channel activity is provided. The method 
involves: 

a) contacting a SOC/CRAC polypeptide with a candidate agent suspected of 
modulating SOC/CRAC calcium channel activity, under conditions sufficient to allow the 
candidate agent to interact selectively with (e.g. bind to) the SOC/CRAC polypeptide; 

b) detecting a Ca^"^ concentration of step (b) associated with the SOC/CRAC calcium 
channel activity of the SOC/CRAC polypeptide in the presence of the candidate agent; and 

c) comparing the Ca^"^ concentration of step (b) with a control Ca^"^ concentration of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates (increases or decreases) SOC/CRAC calcium chaimel activity. 
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According to another aspect of the invention, a method for identifying agents useful in 
the modulation of a SOC/CRAC polypeptide kinase activity is provided. The method 
involves: 

a) contacting a SOC/CRAC polypeptide with kinase activity with a candidate agent 
suspected of modulating SOC/CRAC kinase activity, under conditions sufficient to allow the 
candidate agent to interact with the SOC/CRAC polypeptide and modulate its kinase activity; 

b) detecting a kinase activity associated with the SOC/CRAC polypeptide in the 
presence of the candidate agent; and 

c) comparing the kinase activity of step (b) with a control kinase activity of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates (increases or decreases) SOC/CRAC kinase activity. In some 
embodiments the SOC/CRAC polypeptide comprises amino acids 999-1180 of the SOC- 
2/CRAC-l polypeptide (SEQ ID NO:24), or a fragment thereof that retains the kinase activity. 

According to yet another aspect of the invention, a method for determining the level of 
expression of a SOC/CRAC polypeptide in a subject is provided. The method involves: 

a) measuring the expression of a SOC/CRAC polypeptide in a test sample, and 

b) comparing the measured expression of the SOC/CRAC polypeptide in the test 
sample to the expression of a SOC/CRAC polypeptide in a control containing a known level 
of expression to determine the level of SOC/CRAC expression in the subject. Expression is 
defined as SOC/CRAC mRNA expression or SOC/CRAC polypeptide expression. Various 
methods can be used to measure expression. The preferred embodiments of the invention 
utilize PCR and Northern blotting for measuring mRNA expression, and monoclonal or 
polyclonal SOC/CRAC antisera as reagents for measuring SOC/CRAC polypeptide 
expression. In preferred embodiments, the SOC/CRAC molecule (nucleic acid and/or 
polypeptide) is SOC-2/CRAC-1. In other preferred embodiments, the SOC/CRAC molecule 
is SOC-3/CRAC-2. In yet further preferred embodiments, the SOC/CRAC molecule is SOC- 
4/CRAC-3. In certain embodiments, the test samples include biopsy samples and biological 
fluids such as blood. The method is useful, e.g., for assessing the presence or absence or 
stage of a proliferative disorder in a subject. 

The invention also contemplates kits comprising a package including assays for 
SOC/CRAC epitopes, SOC/CRAC nucleic acids, and instructions, and optionally related 
materials such as controls, for example, a number, color chart, or an epitope of the expression 
product of the foregoing isolated nucleic acid molecules of the invention for comparing, for 
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example, the level of SOC/CRAC polypeptides or SOC/CRAC nucleic acid forms (\vdld-type 
or mutant) in a test sample to the level in a control sample having a known amount of a 
SOC/CRAC nucleic acid or SOC/CRAC polypeptide. This comparison can be used to assess 
in a subject a risk of developing a cancer or the progression of a cancer. The kits may also 
5 include assays for other known genes, and expression products thereof, associated with, for 
example, proliferative disorders (e.g., BRCA, p53, etc.). In a preferred embodiment, the kit 
comprises a package containing: (a) a binding agent that selectively binds to an isolated 
nucleic acid of the invention or an expression product thereof to obtain a measured test value, 
(b) a control containing a known amount of a SOC/CRAC nucleic acid or a SOC/CRAC 
io polypeptide to obtain a measured control value, and (c) instructions for comparing the 
measured test value to the measured control value to determine the amount of SOC/CRAC 
nucleic acid or expression product thereof in a sample. 

The invention provides isolated nucleic acid molecules, unique fragments thereof, 
expression vectors containing the foregoing, and host cells containing the foregoing. The 
IS invention also provides isolated binding polypeptides and binding agents which bind such 
polypeptides, including antibodies, and pharmaceutical compositions containing any of the 
compositions of the invention. The foregoing can be used, inter alia, in the diagnosis or 
treatment of conditions characterized by the aberrant expression levels eind/or the presence of 
mutant forms of a SOC/CRAC nucleic acid or polypeptide. The invention also provides 
20 methods for identifying agents that alter the function of the SOC/CRAC polypeptide. 

These and other aspects of the invention, as well as various advantages and utilities, 
will be more apparent with reference to the detailed description of the preferred embodiments. 

Brief Description of the Sequences 
SEQ ID NO: 1 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
25 SEQ ID NO:2 is the predicted amino acid sequence of the translation product of 

human SOC-2/CRAC-1 cDNA (SEQ ID NO:l). 

SEQ ID NO:3 is a partial nucleotide sequence of the himian SOC-2/CRAC-1 cDNA. 
SEQ ID NO:4 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:3). 
so SEQ ID NO:5 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO:6 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:5). 
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SEQ ID NO:7 is a partial nucleotide sequence of the mouse homologue (mSOC- 
2/CRAC-l) of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO:8 is the predicted amino acid sequence of the translation product of the 
mSOC-2/CRAC-l cDNA (SEQ ID NO:7). 

SEQ ID NO:9 is the nucleotide sequence of the mouse MLSN-1 (SOC-1) cDNA, 

SEQ ID NO: 10 is the predicted amino acid sequence of the translation product of the 
mouse MLSN-1 (SOC-1) cDNA (SEQ ID NO:9). 

SEQ ID NO: 11 is the nucleotide sequence of a human calcium channel cDNA with 
GenBank Acc. no.: AB001535. 

SEQ ID NO: 12 is the predicted amino acid sequence of the translation product of the 
human calcium channel cDNA with GenBank Acc. no.: AB001535 (SEQ ID NO:l 1). 

SEQ ID NO: 13 is the amino acid sequence of a C. Elegans polypeptide at the 
c05cl2.3 locus. 

SEQ ID NO:14 is the amino acid sequence of a C Elegans polypeptide at the F54D1 

locus. 

SEQ ID NO:15 is the amino acid sequence of a C. Elegans polypeptide at the t01H8 

locus. 

SEQ ID NO: 16 is the nucleotide sequence of a mouse kidney cDNA with GenBank 
Acc.no.: AI226731. 

SEQ ID NO: 17 is the predicted amino acid sequence of the translation product of the 
mouse kidney cDNA with GenBank Acc. no.: AI22673 1 (SEQ ID NO: 16). 

SEQ ID NO; 18 is the nucleotide sequence of a human brain cDNA with GenBank 
Acc. no.: HI 8835. 

SEQ ID NO: 19 is the predicted amino acid sequence of the translation product of the 
human brain cDNA with GenBank Acc. no.: H18835 (SEQ ID NO:18). 

SEQ ID NO:20 is the nucleotide sequence of the human EST with GenBank Acc. no.: 
AA4 19592. 

SEQ ID NO:21 is the nucleotide sequence of the human EST with GenBank Acc. no.: 
AA4 19407. 

SEQ ID NO:22 is the nucleotide sequence of the mouse EST with GenBank Acc. no.: 
AI098310. 
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SEQ ID NO:23 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA 
that contains the SOC-2/CRAC-1 sequences of SEQ ID NO:l, SEQ ID NO:3, and SEQ ID 
NO:5. 

SEQ ID NO:24 is the predicted amino acid sequence of the translation product of 
5 human SOC-2/CRAC- 1 cDNA (SEQ ID NO:23). 

SEQ ID NO:25 is a partial nucleotide sequence of the human SOC-3/CRAC-2 cDNA. 
SEQ ID NO:26 is the predicted amino acid sequence of the translation product of 
human SOC-3/CRAC-2 cDNA (SEQ ID NO:25). 

SEQ ID NO:27 is the full nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
w SEQ ID NO:28 is the predicted amino acid sequence of the translation product of 

human SOC-2/CRAC-1 cDNA (SEQ ID NO:27). 

SEQ ID NO:29 is the ftdl nucleotide sequence of the human SOC-3/CRAC-2 cDNA. 
SEQ ID NO:30 is the predicted amino acid sequence of the translation product of 
human SOC.3/CRAC-2 cDNA (SEQ ID NO:29). 
/J SEQ ID NO:3 1 is the full nucleotide sequence of the himian SOC-4/CRAC-3 cDNA. 

SEQ ID NO:32 is the predicted amino acid sequence of the translation product of 
human SOC-4/CRAC.3 cDNA (SEQ ID NO:31). 

Brief Description of the Drawings 
Figure 1 is a schematic depicting the intron/exon organization of the chicken SOC- 
20 2/CRAC-l genomic sequence, as well as the putative transmembrane (TM) domains, and the 
targeting constructs utilized in the knockout experiments. 

Detailed Description of the Invention 
One aspect of the invention involves the partial cloning of cDNAs encoding members 
of a novel family of calcium channel polypeptides, referred to herein as "SOC/CRAC" 
25 (designated "SOC" or "CRAC" or "ICRAC", for Store Operated Channels or Calcium 
Release Activated Channels, or CECH). Although not intending to be bound to any particular 
mechanism or theory, we believe that a SOC/CRAC family member is a transmembrane 
calcium chaimel that modulates Ca^"*" flux "into" and "out of a cell; in certain instances it may 
be activated upon depletion of Ca^"*" from intracellular calcium stores, allowing Ca^"^ influx 
30 into the cell. 

The first three isolated SOC/CRAC members disclosed herein, define a new family of 
calcium channels which is distinct from previously described calcium channels, such as 
voltage gated calcium channels, ryanodine receptor/inositoH,4,5-triphosphate receptor 
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channels, and Transient Receptor Potential (TRP) channels. The SOC/CRAC family of 
calcium channels exhibits high selectivity (with a Pca/Pwa ratio near 1000), a unitary 
conductance below the detection level of the patch clamp method (the conductance estimated 
at approximately 0.2 picosiemens), and are subject to inhibition by high intracellular calcium 
levels. Although not intending to be bound to any particular mechanism or theory, we believe 
that SOC/CRAC calcium channels are responsible for the majority of, for example, calcium 
entry which occurs when intracellular calcium stores are depleted, and that SOC/CRAC 
currents are important for initiating various types of calcium-dependent processes. Thus, we 
believe that SOC/CRAC calcium channels play an important role in cellular calcium 
homeostasis by, e.g., modulating the supply of calcium to refill intracellular stores when 
depleted. 

The isolated full-length sequence of a representative, first member of the SOC/CRAC 
family, human SOC/CRAC nucleic acid (cDNA), SOC-2/CRAC-1, is represented as the 
nucleic acid of SEQ ID NO:27. This nucleic acid sequence codes for the SOC-2/CRAC-1 
polypeptide with the predicted amino acid sequence disclosed herein as SEQ ID NO:28. A 
homologous mouse cDNA sequence (>90% identity to the human at the nucleotide level) is 
represented as the nucleic acid of SEQ ID NO:7, and codes for a unique fragment of a mouse 
SOC-'2/CRAC-l pol3^eptide having the predicted, partial amino acid sequence represented as 
SEQ ID NO:8. Analysis of the SOC-2/CRAC-1 partial sequence by comparison to nucleic 
acid and protein databases show that SOC-2/CRAC-1 shares a limited homology to mouse 
MLSN-l (SOC-1, SEQ ID NOs: 9 and 10). Limited homology is also shared between SOC- 
2/CRAC-l and three C Elegans polypeptides (SEQ ID NOs: 13, 14, and 15). We further 
believe that SOC-2/CRAC-1 plays a role in the regulation of cellular Ca^^ fluxing and, in 
particular, lymphocyte Ca^^ fluxing. 

A second member of the human SOC/CRAC family of calcium channels, SOC- 
3/CRAC-2, is represented as the nucleic acid of SEQ ID NO:29, and codes for the human 
SOC-3/CRAC-2 polypeptide having the predicted amino acid sequence represented as SEQ 
ID NO:30 (this molecule may also be referred to as CECH2). SOC-3/CRAC-2 is 
predominantly expressed in human hematopoietic cells (including peripheral blood 
lymphocytes, liver, bone marrow, spleen, thymus, lymph nodes, heart, and kidney. 
Expression can also be detected (at lesser levels) in brain, skeletal muscle colon, small 
intestine, placenta, lung, and cells (cell lines) such as HL-60, HeLa, K562, MOLT-4, SW-480, 
A459, and G361. 
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A third member of the human SOC/CRAC family of calcium channels, SOC- 
4/CRAC-3, is represented as the nucleic acid of SEQ ID NO:31, and codes for the human 
SOC-4/CRAC>3 polypeptide having the predicted amino acid sequence represented as SEQ 
ID NO:32 (this molecule may also be referred to as CECH6). It specifically expressed in the 
prostate gland/cells. 

As used herein, a SOC/CRAC calcium channel nucleic acid (also referred to herein as 
a "SOC/CRAC nucleic acid" refers to a nucleic acid molecule which: (1) hybridizes under 
stringent conditions to one or more of the nucleic acids having the sequences of SEQ. ID 
NOS. 7, 27, 29, and/or 31 (sequences of the mouse and human SOC-2/CRAC-1, human SOC- 
3/CRAC-2, and human SOC-4/CRAC-3 nucleic acids), and (2) codes for a SOC-2/CRAC-1, a 
SOC-3/CRAC-2 or a SOC-4/CRAC-3 calcium channel polypeptide, respectively, or unique 
fragments of said SOC-2/CRAC-1, SOC-3/CRAC-2, or SOC-4/CRAC-3 polypeptide. 

As used herein, a SOC/CRAC calcium channel polypeptide (also referred to herein as 
a "SOC/CRAC polypeptide") refers to a polypeptide that is coded for by a SOC-2/CRAC-1 , a 
SOC-3/CRAC-2, and/or a SOC-4/CRAC-3 nucleic acid. Preferably, the above-identified 
SOC/CRAC polypeptides mediate transport of calcium into and out of a cell. 

SOC/CRAC polypeptides also are usefiil as immunogenic molecules for the 
generation of binding polypeptides (e.g., antibodies) which bind selectively to SOC/CRAC 
(e.g., SOC-2/CRAC-1, SOC-3/CRAC-2, and/or SOC-4/CRAC-3) polypeptides. Such 
antibodies can be used in diagnostic assays to identify and/or quantify the presence of a 
SOC/CRAC polypeptide in a sample, such as a biological fluid or biopsy sample. 
SOC/CRAC polypeptides further embrace functionally equivalent fragments, variants, and 
analogs of the preferred SOC/CRAC polypeptides, provided that the fragments, variants, and 
analogs also are useful in mediating calcium transport into and out of intracellular calcium 
stores. 

As used herein, "SOC/CRAC calcium channel activity" refers to Ca^"^ transport ("Ca^"^ 
fluxing") across the plasma membrane that is mediated by a SOC/CRAC calcium channel 
polypeptide. The SOC/CRAC calcium channel polypeptide typically has one or more of the 
following properties: high selectivity, a unitary conductance below the detection level of the 
patch clamp method,and are subject to inhibition by high intracellular calcium levels. Such 
activity can be easily detected using standard methodology well known in the art. See, e.g., 
the Examples and Neher, E., "Ion chaimels for communication between and within cells". 
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Science. 1992 : 256:498-502; and Hoth. M., and Penner, R., "Depletion of intracellular 
calcium stores activates a calcium current in mast cells" Nature, 1992: 355 (6358):353-6. 

According to one aspect of the invention, isolated nucleic acid molecules which code 
for one or more member{s) of the SOC/CRAC family of calcium channel polypeptides are 
provided. The isolated nucleic acid molecules are selected from the following groups: 

(a) nucleic acid molecules which hybridize under stringent conditions to one or more 
nucleic acid molecules selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, and SEQ ID NO:3 1 , and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

In certain embodiments, the isolated nucleic acid molecule comprises one or more of 
nucleotides 1-1212 of SEQ IDNO:l; nucleotides 1-739 of SEQ ID NO:3; nucleotides 1-1579 
of SEQ ID NO:5; nucleotides 1-5117 of SEQ ID NO:23; the mouse homolog for SOC- 
2/CRAC-l corresponding to SEQ ID NO:7; nucleotides 1-2180 of SEQ ID NO:25; 
nucleotides 382-5976 of SEQ ID NO:27; nucleotides 73-3714 of SEQ ID NO:29; and 
nucleotides 23-3434 of SEQ ID NO:3L In yet other embodiments, the isolated nucleic acid 
molecule comprises a molecule which encodes a polypeptide having one or more sequences 
selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32. 

According to yet another aspect of the invention, an isolated nucleic acid molecule is 
provided which is selected from the group consisting of: 

(a) a unique fragment of a nucleic acid molecule selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31, (of sufficient length to represent a 
sequence unique within the human genome); and (b) complements of (a), provided that the 
unique fragment includes a sequence of contiguous nucleotides which is not identical to a 
sequence in the prior art as represented by the sequence group consisting of: (1) sequences 
having the SEQ ID NOs or GenBank accession numbers of Table I, (2) complements of (1), 
and (3) fragments of (1) and (2). 
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In some embodiments, the sequence of contiguous nucleotides is selected from the 
group consisting of (1) at least two contiguous nucleotides nonidentical to the sequence 
group, (2) at least three contiguous nucleotides nonidentical to the sequence group, (3) at least 
four contiguous nucleotides nonidentical to the sequence group, (4) at least five contiguous 
nucleotides nonidentical to the sequence group, (5) at least six contiguous nucleotides 
nonidentical to the sequence group, (6) at least seven contiguous nucleotides nonidentical to 
the sequence group. 

In other embodiments, the imique fragment has a size selected from the group 
consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 14 nucleotides, 16 
nucleotides, 18 nucleotides, 20, nucleotides, 22 nucleotides, 24 nucleotides, 26 nucleotides, 
28 nucleotides, 30 nucleotides, 40 nucleotides, 50 nucleotides, 75 nucleotides, 100 
nucleotides, 200 nucleotides, 1000 nucleotides and every integer length therebetween. 

According to another aspect of the invention, expression vectors and host cells 
containing (e.g., transformed or transfected with) expression vectors comprising the nucleic 
acid molecules disclosed herein operably linked to a promoter are provided. In certain 
preferred embodiments, the host cells are eukaryotic cells. 

The isolated nucleic acid molecules disclosed herein have various utilities, including 
their use as probes and primers to identify additional members of the SOC/CRAC family of 
calciimi chaimels, as diagnostic reagents for identifying the presence of SOC/CRAC 
polypeptides in biological or other samples, and as agents for generating SOC/CRAC binding 
polypeptides (e.g., antibodies) that can be used as reagents in diagnostic and therapeutic 
assays to identify the presence, absence, and/or amounts of a SOC/CRAC nucleic acid or 
polypeptide in a biological or other sample. 

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified 
in vitro by, for example, polymerase chain reaction (PGR); (ii) recombinantly produced by 
cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, 
chemical synthesis. An isolated nucleic acid is one which is readily manipulatable by 
recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained 
in a vector in which 5' and 3' restriction sites are known or for which polymerase chain 
reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid 
sequence existing in its native state in its natural host is not. An isolated nucleic acid may be 
substantially purified, but need not be. For example, a nucleic acid that is isolated within a 
cloning or expression vector is not pure in that it may comprise only a tiny percentage of the 
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material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is 
used herein because it is readily manipulatable by standard techniques known to those of 
ordinary skill in the art. 

As used herein with respect to polypeptides (discussed below), the term "isolated" 
means separated from its native environment in sufficiently pure form so that it can be 
manipulated or used for any one of the purposes of the invention. Thus, isolated means 
sufficiently pure to be used (i) to raise and/or isolate antibodies, (ii) as a reagent in an assay, 
or (iii) for sequencing, etc. 

Homologs and alleles of the SOC/CRAC nucleic acids of the invention can be 
identified by conventional techniques. Thus, an aspect of the invention is those nucleic acid 
sequences which code for SOC/CRAC polypeptides and which hybridize to a nucleic acid 
molecule selected from a group consisting of the nucleic acid of SEQ ID NO:l, the nucleic 
acid of SEQ ID NO:3, the nucleic acid of SEQ ID N0:5, the nucleic acid of SEQ ID NO:7, 
the nucleic acid of SEQ ID NO:23, the nucleic acid of SEQ ID NO:25, the nucleic acid of 
SEQ ID NO:27, the nucleic acid of SEQ ID NO:29, and the nucleic acid of SEQ ID NO:31, 
under stringent conditions. The term "stringent conditions" as used herein refers to 
parameters with which the art is familiar. Nucleic acid hybridization parameters may be 
found in references which compile such methods, e.g. Molecular Cloning: A Laboratory 
Manual^ J. Sambrook, et al., eds.. Second Edition, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, 
et al., eds., John Wiley & Sons, Inc., New York. More specifically, stringent conditions, as 
used herein, refers, for example, to hybridization at 65°C in hybridization buffer (3.5 x SSC, 
0.02% Ficoll, 0.02% polyvinyl pyrolidone, 0.02% Bovine Serum Albumin, 2.5mM 
NaH2P04(pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium chIoride/0.15M sodium 
citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is ethylenediaminetetracetic acid. 
After hybridization, the membrane upon which the DNA is transferred is washed at 2 x SSC 
at room temperature and then at 0.1 x SSC/0, 1 x SDS at temperatures up to 68°C. 

There are other conditions, reagents, and so forth which can be used, and would result 
in a similar degree of stringency. The skilled artisan will be familiar with such conditions, 
and thus they are not given here. It will be understood, however, that the skilled artisan will 
be able to manipulate the conditions in a manner to permit the clear identification of 
homologs and alleles of the SOC/CRAC nucleic acids of the invention. The skilled artisan 
also is familiar with the methodology for screening cells, and libraries for expression of such 
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molecules which then are routinely isolated, followed by isolation of the pertinent nucleic 
acid molecule and sequencing. 

In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and/or SEQ ID 
NO:31, and SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, respectively. In 
some instances sequences will share at least 50% nucleotide identity and/or at least 65% 
amino acid identity and in still other instances sequences will share at least 60% nucleotide 
identity and/or at least 75% amino acid identity. The homology can be calculated using 
various, publicly available software tools developed by NCBI (Bethesda, Maryland) that can 
be obtained through the internet (fltp:/ncbi.nlm.nih.gov/pub/). Exempleiry tools include the 
BLAST system available at http://www.ncbi.nlm.nih.gov, Pairwise and ClustalW alignments 
(BLOSIJM30 matrix setting) as well as Kyte-Doolittle hydropathic analysis can be obtained 
using the MacVetor sequence analysis software (Oxford Molecular Group). Watson-Crick 
complements of the foregoing nucleic acids also are embraced by the invention. 

In screening for SOC/CRAC related genes, such as homologs and alleles of SOC- 
2/CRAC-l and/or SOC-3/CRAC-2, a Southern blot may be performed using the foregoing 
conditions, together with a radioactive probe. After washing the membrane to which the 
DNA is finall}' transferred, the membrane can be placed against X-ray film or a 
phosphoimager plate to detect the radioactive signal. 

Given that the expression of the SOC/CRAC gene is prominent in certain human 
tissues (e.g., SOC-2/CRAC-1: lymphoid tissue/heart, SOC-3/CRAC-2: kidney/colon, SOC- 
4/CRAC-3: prostate), and given the teachings herein of partial human SOC/CRAC cDNA 
clones, full-length and other mammalian sequences corresponding to the human SOC/CRAC 
partial nucleic acid sequences can be isolated from, for example, a cDNA library prepared 
from one or more of the tissues in which SOC-2/CRAC-1 expression is prominent, SOC- 
3/CRAC-2 is prominent, and/or SOC-4/CRAC-3 expression is prominent, using standard 
colony hybridization techniques. 

The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the 
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art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vivo, to incorporate a serine residue into an elongating 
SOC/CRAC polypeptide. Similarly, nucleotide sequence triplets which encode other amino 
acid residues include, but are not limited to: CCA, CCC, CCG and CCT (proline codons); 
5 CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, AGO and ACT 
(threonine codons); AAC £md AAT (asparagine codons); and ATA, ATC and ATT (isoleucine 
codons). Other amino acid residues may be encoded similarly by multiple nucleotide 
sequences. Thus, the invention embraces degenerate nucleic acids that differ from the 
biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic 
10 code. 

The invention also provides isolated unique fragments of an isolated nucleic acid 
molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID 
NO:31. A unique fragment is one that is a 'signature' for the larger nucleic acid. For 

15 example, the unique fragment is long enough to assure that its precise sequence is not found 
in molecules within the human genome outside of the SOC/CRAC nucleic acids defined 
above (and human alleles). Those of ordinary skill in the art may apply no more than routine 
procedures to determine if a fragment is unique within the human genome. 

Unique fragments, however, exclude fragments completely composed of the 

20 nucleotide sequences of any of GenBank accession numbers and SEQ ID NOs listed in Table 
I (SEQ ID NO:9, AB001535, AI226731, H18835, AA4I9592, AA261842, AA419407, 
AI098310, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, AC005538, AA654650, AA370110, AA313170, 
AA493512, AI670079, AI671853, AC005538, AA654650, AA370110, AA313170, 

25 AA493512, AI670079, AI671853), or other previously published sequences as of the filing 
date of this application. 

A fragment which is completely composed of the sequence described in the foregoing 
GenBank deposits and SEQ ID NO:9, is one which does not include any of the nucleotides 
unique to the sequences of the invention. Thus, a unique fragment must contain a nucleotide 

30 sequence other than the exact sequence of those in GenBank or fragments thereof. The 
difiTerence may be an addition, deletion or substitution with respect to the GenBank sequence 
or it may be a sequence wholly separate from the GenBank sequence. 
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Unique fragments can be used as probes in Southern and Northern blot assays to 
identify such nucleic acids, or can be used in amplification assays such as those employing 
PGR. As known to those skilled in the art, large probes such as 200, 250, 300 or more 
nucleotides are preferred for certain uses such as Southern and Northern blots, while smaller 
fragments will be preferred for uses such as PGR. Unique fragments also can be used to 
produce fusion proteins for generating antibodies or determining binding of the polypeptide 
fragments, as demonstrated in the Examples, or for generating immunoassay components. 
Likewise, unique fragments can be employed to produce nonftised fragments of the 
SOG/GRAG polypeptides, useful, for example, in the preparation of antibodies, 
inmiunoassays or therapeutic applications. Unique fragments further can be used as antisense 
molecules to inhibit the expression of SOG/GRAG nucleic acids and polypeptides, 
respectively. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:l, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, and SEQ ID NO:31, and complements thereof, will require longer segments 
to be unique while others will require only short segments, typically between 12 and 32 
nucleotides long (e.g. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
3 1 and 32 bases) or more, up to the entire length of the disclosed sequence. As mentioned 
above, this disclosure intends to embrace each and every fragment of each sequence, 
beginning at the first nucleotide, the second nucleotide and so on, up to 8 nucleotides short of 
the end, and ending anywhere from nucleotide number 8, 9, 10 and so on for each sequence, 
up to the very last nucleotide, (provided the sequence is unique as described above). Virtually 
any segment of the region of SEQ ID NO:l beginning at nucleotide I and ending at 
nucleotide 1212, or SEQ ID NO:3 beginning at nucleotide 1 and ending at nucleotide 739, or 
SEQ ID NO:5 beginning at nucleotide 1 and ending at nucleotide 1579, or SEQ ID NO:7 
begiiming at nucleotide I and ending at nucleotide 3532, or SEQ ID NO:23 beginning at 
nucleotide 1 and ending at nucleotide 5117, SEQ ID NO:25 beginning at nucleotide 1 and 
ending at nucleotide 2180, SEQ ID NO:27 beginning at nucleotide 1 and ending at nucleotide 
7419, or SEQ ID NO:29 beginning at nucleotide 1 and ending at nucleotide 4061, or SEQ ID 
NO:31 beginning at nucleotide 1 and ending at nucleotide 4646, or complements thereof, that 
is 20 or more nucleotides in length will be unique. Those skilled in the art are well versed in 
methods for selecting such sequences, typically on the basis of the ability of the tmique 
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fragment to selectively distinguish the sequence of interest from other sequences in the human 
genome of the fragment to those on known databases typically is all that is necessary, 
although in vitro confirmatory hybridization and sequencing analysis may be performed. 

As mentioned above, the invention embraces antisense oligonucleotides that 
selectively bind to a nucleic acid molecule encoding a SOC/CRAC polypeptide, to decrease 
SOC/CRAC calcium channel activity. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
oligonucleotide that is an oligoribonucleotide, oligodeoxy ribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under 
physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript. Those 
skilled in the art will recognize that the exact length of the antisense oligonucleotide and its 
degree of complementarity with its target will depend upon the specific target selected, 
including the sequence of the target and the particular bases which comprise that sequence. It 
is preferred that the antisense oligonucleotide be constructed and arranged so as to bind 
selectively with the target under physiological conditions, i.e., to hybridize substantially more 
to the target sequence than to any other sequence in the target cell under- physiological 
conditions. Based upon SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31, or upon allelic 
or homologous genomic and/or cDNA sequences, one of skill in the art can easily choose and 
synthesize any of a number of appropriate antisense molecules for use in accordance wdth the 
present invention. In order to be sufficiently selective and potent for inhibition, such 
antisense oligonucleotides should comprise at least 10 and, more preferably, at least 15 
consecutive bases which are complementary to the target, although in certain cases modified 
oligonucleotides as short as 7 bases in length have been used successfully as antisense 
oligonucleotides (Wagner et al., Nai. Med 1(11): 1 116-1 1 18, 1995). Most preferably, the 
antisense oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal 
or 5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
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addition, 3 '-untranslated regions may be targeted by antisense oligonucleotides. Targeting to 
mRNA splicing sites has also been used in the art but may be less preferred if alternative 
mRNA splicing occurs. In addition, the antisense is targeted, preferably, to sites in which 
mRNA secondary structure is not expected (see, e.g., Sainio et al.. Cell Mol Neurobiol, 
14(5):439-457, 1 994) and at which proteins are not expected to bind. Finally, although, SEQ 
ID No. l discloses a cDNA sequence, one of ordinary skill in the art may easily derive the 
genomic DNA corresponding to this sequence. Thus, the present invention also provides for 
antisense oligonucleotides which are complementary to the genomic DNA corresponding to 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5,SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31. Similarly, antisense to allelic or 
homologous SOC/CRAC cDNAs and genomic DNAs are enabled without undue 
experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural'" deoxyribonucleotides, ribonucleotides, or any combination thereof. 
That is, the 5' end of one native nucleotide and the V end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out 
manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention 
also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a number of ways which do not prevent them from hybridizing to their target but which 
enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one 
nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group not normally 
associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred 
synthetic intemucleoside linkages are phosphorothioates, alkylphosphonates, 
phosphorodithioates, phosphate esters, alkylphosphonothioates, phosphoramidates, 
carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
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oiigonucieotides having backbone sugars which are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3' position and other than a 
phosphate group at the 5' position. Thus modified oligonucleotides may include a 2'-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 
arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 
preparations containing modified antisense molecules that are complementary to and 
hybridizable with, under physiological conditions, nucleic acids encoding SOC/CRAC 
polypeptides, together with pharmaceutically acceptable carriers. Antisense oligonucleotides 
may be administered as part of a pharmaceutical composition. Such a pharmaceutical 
composition may include the antisense oligonucleotides in combination with any standard 
physiologically and/or pharmaceutically acceptable carriers which are known in the art. The 
compositions should be sterile and contain a therapeutically effective amount of the antisense 
oligonucleotides in a unit of weight or volume suitable for administration to a patient. The 
term "pharmaceutically acceptable" means a non-toxic material that does not interfere with 
the effectiveness of the biological activity of the active ingredients. The term 
"physiologically acceptable" refers to a non-toxic material that is compatible with a biological 
system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will 
depend on the route of administration. Physiologically and pharmaceutically acceptable 
carriers include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials 
which are well known in the art. 

The invention also involves expression vectors coding for SOC/CRAC proteins and 
fragments and variants thereof and host cells containing those expression vectors. Virtually 
any cells, prokaryotic or eukaryotic, which can be transformed with heterologous DNA or 
RNA and which can be grown or maint£iined in culture, may be used in the practice of the 
invention. Examples include bacterial cells such as E.coli and eukaryotic cells such as mouse, 
hamster, pig, goat, primate, yeast, xenopous, etc. They may be of a wide variety of tissue 
types, including mast cells, fibroblasts, oocytes and lymphocytes, and they may be primary 
cells or cell lines. Specific examples include CHO cells and COS cells. Cell-free 
transcription systems also may be used in lieu of cells. 

As used herein, a "vector" may be any of a number of nucleic acids into which a 
desired sequence may be inserted by restriction and ligation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of 
DNA although RNA vectors are also available. Vectors include, but are not limited to, 
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plasmids, phagemids and virus genomes. A cloning vector is one which is able to replicate in 
a host cell, and which is further characterized by one or more endonuclease restriction sites at 
which the vector may be cut in a determinable fashion and into which a desired DNA 
sequence may be ligated such that the new recombinant vector retains its ability to replicate in 
the host cell. In the case of plasmids, replication of the desired sequence may occur many 
times as the plasmid increases in copy number within the host bacterium or just a single time 
per host before the host reproduces by mitosis. In the case of phage, replication may occur 
actively during a lytic phase or passively during a lysogenic phase. An expression vector is 
one into which a desired DNA sequence may be inserted by restriction and ligation such that 
it is operably joined to regulatory sequences and may be expressed as an RNA transcript. 
Vectors may further contain one or more marker sequences suitable for use in the 
identification of cells which have or have not been transformed or transfected with the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes 
whose activities are detectable by standard assays known in the art (e.g., p-galactosidase or 
alkaline phosphatase), and genes which visibly affect the phenotype of transformed or 
transfected cells, hosts, colonies or plaques (e.g., green fluorescent protein). Preferred vectors 
are those capable of autonomous replication and expression of the structural gene products 
present in the DNA segments to which they are operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked in such a way as to place the expression or 
transcription of the coding sequence imder the influence or control of the regulatory 
sequences. If it is desired that the coding sequences be translated into a functional protein, 
two DNA sequences are said to be operably joined if induction of a promoter in the 5' 
regulatory sequences results in the transcription of the coding sequence and if the nature of 
the linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region to direct the transcription 
of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript 
to be translated into a protein. Thus, a promoter region would be operably joined to a coding 
sequence if the promoter region were capable of effecting transcription of that DNA sequence 
such that the resulting transcript might be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribed 
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and 5' non-translated sequences involved with the initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. 
Especially, such 5' non-transcribed regulatory sequences will include a promoter region 
which includes a promoter sequence for transcriptional control of the operably joined gene. 
Regulatory sequences may also include enhancer sequences or upstream activator sequences 
as desired. The vectors of the invention may optionally include 5' leader or signal sequences. 
The choice and design of an appropriate vector is within the ability and discretion of one of 
ordinary skill in the art. 

According to yet another aspect of the invention, isolated SOC/CRAC polypeptides 
are provided. Preferably, the isolated SOC/CRAC polypeptides are encoded by the isolated 
SOC/CRAC nucleic acid molecules disclosed herein. More preferably, the isolated 
SOC/CRAC polypeptides of the invention are encoded by the nucleic acid molecules having 
SEQ ID Nos. L 3, 5, 7, 23, 25, 27, 29, and 31. In yet other embodiments, the isolated 
SOC/CRAC polypeptides of the invention have an amino acid sequence selected from the 
group consisting of SEQ ID Nos. 2, 4, 6, 8, 24, 26, 28, 30 and 32. Preferably, the isolated 
SOC/CRAC polypeptides are of sufficient length to represent a sequence unique within the 
human genome. Thus, the preferred embodiments include a sequence of contiguous amino 
acids which is not identical to a prior art sequence as represented by the sequence group 
consisting of the contiguous amino acids identified in Table II (SEQ ID NO: 10, SEQ ID 
NO:13, SEQ ID NO:I4, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19 and GenBank Acc. 
Nos. AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP002411, CAA92726, 
CAB05572). 

In certain embodiments, the isolated SOC/CRAC polypeptides are inununogenic and 
can be used to generate binding polypeptides (e.g., antibodies) for use in diagnostic and 
therapeutic applications. Such binding polypeptides also are useful for detecting the presence, 
absence, and/or amounts of a SOC/CRAC nucleic acid or polypeptide in a sample such as a 
biological fluid or biopsy sample. Preferably, the SOC/CRAC polypeptides that are useful for 
generating binding polypeptides are unique polypeptides and, therefore, binding of the 
antibody to a SOC/CRAC polypeptide in a sample is selective for the SOC/CRAC 
polypeptide. 

Expression vectors containing all the necessary elements for expression are 
conunercially available and known to those skilled in the art. See, e.g., Sambrook et al.. 
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Molecular Cloning: A Laboratory Manual^ Second Edition, Cold Spring Harbor Laboratory 
Press, 1989. Cells are genetically engineered by the introduction into the cells of 
heterologous DNA (RNA) encoding a SOC/CRAC polypeptide or firagment or variant thereof. 
The heterologous DNA (RNA) is placed under operable control of transcriptional elements to 
permit the expression of the heterologous DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as 
pRc/CMV (available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as 
a gene that confers G418 resistance (which facilitates the selection of stably transfected cell 
lines) and the human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, 
suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen, 
Carlsbad, CA), which contains an Epstein Barr virus (EBV) origin of replication, facilitating 
the maintenance of plasmid as a multicopy extrachromosomal element. Another expression 
vector is the pEF-BOS plasmid containing the promoter of polypeptide Elongation Factor la, 
which stimulates efficiently transcription in vitro. The plasmid is described by Mishizuma 
and Nagata {Nuc. Acids Res. 18:5322, 1990), and its use in transfection experiments is 
disclosed by, for example, Demoulin (MoL Cell. Biol. 16:4710-4716, 1996). Still another 
preferred expression vector is an adenovirus, described by Stratford-Perricaudet, which is 
defective for El and E3 proteins (J. Clin. Invest. 90:626-630, 1992). The use of the 
adenovirus as an Adeno.PlA recombinant is disclosed by Wamier et al., in intradermal 
injection in mice for immimization against PI A (Int. J. Cancer, 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to 
prepare a desired expression vector or vectors. Such expression kits include at least separate 
portions of each of the previously discussed coding sequences. Other components may be 
added, as desired, as long as the previously mentioned sequences, which are required, are 
included. 

It will also be recognized that the invention embraces the use of the above described, 
SOC/CRAC cDNA sequence containing expression vectors, to transfect host cells and cell 
lines, by these prokaryotic (e.g., E. coli \ or eukaryotic (e.g., CHO cells, COS cells, yeast 
expression systems and recombinant baculovirus expression in insect cells). Especially useful 
are mammalian cells such as mouse, hamster, pig, goat, primate, etc. They may be of a wide 
variety of tissue types, and include primary cells and cell lines. Specific examples include 
dendritic cells, U293 cells, peripheral blood leukocytes, bone marrow stem cells and 
embryonic stem cells. The invention also permits the construction of SOC/CRAC gene 
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"knock-outs" in cells and in animals, providing materials for studying certain aspects of 
SOC/CRAC calcium channel activity. 

The invention also provides isolated polypeptides (including whole proteins and 
partial proteins), encoded by the foregoing SOC/CRAC nucleic acids, and include the 
polypeptides of SEQ ID NO:2, 4, 6, 8, 24, 26, 28, 30, 32, and unique fragments thereof. Such 
polypeptides are useful, for example, to regulate calcium transport-mediated cell growth, 
differentiation and proliferation, to generate antibodies, as components of immimoassays, etc. 
Polypeptides can be isolated from biological samples including tissue or cell homogenates, 
and can also be expressed recombinantly in a variety of prokaryotic and eukaryotic expression 
systems by constructing an expression vector appropriate to the expression system, 
introducing the expression vector into the expression system, and isolating the recombinantly 
expressed protein. Short polypeptides, including antigenic peptides (such as are presented by 
MHC molecules on the surface of a cell for immune recognition) also can be synthesized 
chemically using well-established methods of peptide synthesis. 

A unique fragment of a SOC/CRAC polypeptide, in general, has the features and 
characteristics of unique fragments as discussed above in connection with nucleic acids. As 
will be recognized by those skilled in the art, the size of the imique fragment will depend 
upon factors such as whether the fragment constitutes a portion of a conserved protein 
domain. Thus, some regions of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 
SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, will 
require longer segments to be unique while others will require only short segments, typically 
between 5 and 12 amino acids (e.g. 5, 6, 7, 8, 9, 10, 11 and 12 amino acids long or more, 
including each integer up to the frill length, > 1,000 amino acids long). Virtually any segment 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, excluding the ones that share 
identity with it (the polypeptides identified in Table II - SEQ ID NO: 10, SEQ ID NO: 13, SEQ 
ID NO: 14, SEQ ID NO: 15, SEQ ID NO:17, SEQ ID NO: 19, and GenBank Acc. Nos. 
AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP002411, CAA92726, 
CAB05572) that is 9 or more amino acids in length will be unique. 

Unique fragments of a polypeptide preferably are those fragments which retain a 
distinct frmctional capability of the polypeptide. Functional capabilities which can be retained 
in a unique fragment of a polypeptide include Ca^* fluxing, high selectivity, a imitary 
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conductance below the detection level of the patch clamp method, and/or £md are subject to 
inhibition by high intracellular calcium levels. 

One important aspect of a unique fragment is its ability to act as a signature for 
identifying the polypeptide. Optionally, another aspect of a unique fi^gment is its ability to 
provide an immune response in an animal. Those skilled in the art are well versed in methods 
for selecting unique amino acid sequences, typically on the basis of the ability of the unique 
fragment to selectively distinguish the sequence of interest from non-family members. A 
comparison of the sequence of the fragment to those on known databases typically is all that 
is necessary. 

The invention embraces variants of the SOC/CRAC polypeptides described above. As 
used herein, a "variant" of a SOC/CRAC polypeptide is a polypeptide which contains one or 
more modifications to the primary amino acid sequence of a SOC/CRAC polypeptide. 
Modifications which create a SOC/CRAC poljqjeptide variant are typically made to the 
nucleic acid which encodes the SOC/CRAC polypeptide, and can include deletions, point 
mutations, truncations, amino acid substitutions and addition of amino acids or non-amino 
acid moieties to: 1) reduce or eliminate a calcium channel activity of a SOC/CRAC 
polypeptide; 2) enhance a property of a SOC/CRAC polypeptide, such as protein stability in 
an expression system or the stability of protein-protein binding; 3) provide a novel activity or 
property to a SOC/CRAC polypeptide, such as addition of an antigenic epitope or addition of 
a detectable moiety; or 4) to provide equivalent or better binding to a SOC/CRAC polypeptide 
receptor or other molecule. Alternatively, modifications can be made directly to the 
polypeptide, such as by cleavage, addition of a linker molecule, addition of a detectable 
moiety, such as biotin, addition of a fatty acid, and the like. Modifications also embrace 
fusion proteins comprising all or part of the SOC/CRAC amino acid sequence. One of skill in 
the art will be familiar with methods for predicting the effect on protein conformation of a 
change in protein sequence, and can thus "design" a variant SOC/CRAC polypeptide 
according to known methods. One example of such a method is described by Dahiyat and 
Mayo in Science 278:82-87, 1997, whereby proteins can be designed de novo. The method 
can be applied to a known protein to vary only a portion of the polypeptide sequence. By 
applying the computational methods of Dahiyat and Mayo, specific variants of a SOC/CRAC 
calcium channel polypeptide can be proposed and tested to determine whether the variant 
retains a desired conformation. 
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Variants can include SOC/CRAC polypeptides which are modified specifically to alter 
a feature of the polypeptide unrelated to its physiological activity. For example, cysteine 
residues can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, 
certain amino acids can be changed to enhance expression of a SOC/CRAC polypeptide by 
eliminating proteolysis by proteases in an expression system (e.g., dibasic amino acid residues 
in yeast expression systems in which KJBX2 protease activity is present). 

Mutations of a nucleic acid which encodes a SOC/CRAC polypeptide preferably 
preserve the amino acid reading frame of the coding sequence and, preferably, do not create 
regions in the nucleic acid which are likely to hybridize to form secondary structures, such as 
hairpins or loops, which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant SOC/CRAC polypeptides) which are silent as to the amino 
acid sequence of the polypeptide, but which provide preferred codons for translation in a 
particular host. The preferred codons for translation of a nucleic acid in, e.g., E. coli, are well 
known to those of ordinary skill in the art. Still other mutations can be made to the noncoding 
sequences of a SOC/CRAC gene or cDNA clone to enhance expression of the polypeptide. 

The skilled artisan will realize that conservative amino acid substitutions may be made 
in SOC/CRAC polypeptides to provide functionally equivalent variants of the foregoing 
polypeptides, i.e, the variants retain the functional capabilities of the SOC/CRAC 
polypeptides. As used herein, a "conservative amino acid substitution" refers to an amino 
acid substitution which does not alter the relative charge or size characteristics of the protein 
in which the amino acid substitution is made. Variants can be prepared according to methods 
for altering polypeptide sequence known to one of ordinary skill in the art such as are found in 
references which compile such methods, e.g. Molecular Cloning: A Laboratory Manual^ J. 
Sambrook, et ai., eds.. Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, et al., 
eds., John Wiley & Sons, Inc., New York. Exemplary functionally equivalent variants of the 
SOC/CRAC polypeptides include conservative amino acid substitutions of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, and/or SEQ ID NO:32. Conservative substitutions of amino acids 
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include substitutions made amongst amino acids vsdthin the following groups: (a) M, I, L, V; 
(b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q. N; and (g) E, D. 

Thus functionally equivalent variants of SOC/CRAC polypeptides, i.e., variants of 
SOC/CRAC polypeptides which retain the function of the natural SOC/CRAC polypeptides, 
are contemplated by the invention. Conservative amino-acid substitutions in the amino acid 
sequence of SOC/CRAC polypeptides to produce functionally equivalent variants of 
SOC/CRAC polypeptides typically are made by alteration of a nucleic acid encoding 
SOC/CRAC polypeptides (e.g., SEQ ID NOs:l, 3, 5, 7, 23, 25, 27, 29, 31). Such substitutions 
can be made by a variety of methods known to one of ordinary skill in the art. For example, 
amino acid substitutions may be made by PCR-directed mutation, site-directed mutagenesis 
according to the method of Kunkel (Kunkel, Proc. Nat. Acad. Sci. U.S.A. 82: 488-492, 1985), 
or by chemical synthesis of a gene encoding a SOC/CRAC polypeptide. The activity of 
functionally equivalent fragments of SOC/CRAC polypeptides can be tested by cloning the 
gene encoding the altered SOC/CRAC polypeptide into a bacterial or mammalian expression 
vector, introducing the vector into an appropriate host cell, expressing the altered SOC/CRAC 
polypeptide, and testing for a functional capability of the SOC/CRAC polypeptides as 
disclosed herein (e.g., SOC/CRAC calcium channel activity). 

The invention as described herein has a number of uses, some of which are described 
elsewhere herein. First, the invention permits isolation of SOC/CRAC polypeptides, 
including the isolation of the complete SOC/CRAC polypeptide. A variety of methodologies 
well-known to the skilled practitioner c£in be utilized to obtain isolated SOC/CRAC 
molecules. The polypeptide may be purified from cells which naturally produce the 
polypeptide by chromatographic means or immunological recognition. Alternatively, an 
expression vector may be introduced into cells to cause production of the polypeptide. In 
another method, mRNA transcripts may be microinjected or otherwise introduced into cells to 
cause production of the encoded polypeptide. Translation of SOC/CRAC mRNA in cell-free 
extracts such as the reticulocyte lysate system also may be used to produce SOC/CRAC 
polypeptides. Those skilled in the art also can readily follow known methods for isolating 
SOC/CRAC polypeptides. These include, but are not limited to, immunochromatography, 
HPLC, size-exclusion chromatography, ion-exchange chromatography and immune-aflinity 
chromatography. 

The invention also provides, in certain embodiments, "dominant negative" 
polypeptides derived from SOC/CRAC polypeptides. A dominant negative polypeptide is an 
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inactive variant of a protein, which, by interacting with the cellular machinery, displaces an 
active protein from its interaction with the cellular machinery or competes with the active 
protein, thereby reducing the effect of the active protein. For example, a dominant negative 
receptor which binds a iigand but does not transmit a signal in response to binding of the 
ligand can reduce the biological effect of expression of the ligand. Likewise, a dominant 
negative inactive SOC/CRAC calcium channel which interacts normally with the cell 
membrane but which does not mediate calcium transport can reduce calcium transport in a 
cell. Similarly, a dominant negative transcription factor which binds to a promoter site in the 
control region of a gene but does not increase gene transcription can reduce the effect of a 
normal transcription factor by occupying promoter binding sites without increzising 
transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the 
potential for a dominant negative variant of a protein, and using standard mutagenesis 
techniques to create one or more dominant negative variant polypeptides. See, e.g., U.S. 
Patent No. 5,580,723 and Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor Laboratory Press. The skilled artisan then can test the 
population of mutagenized polypeptides for diminution in a selected and/or for retention of 
such an activity. Other similar methods for creating and testing dominant negative variants of 
a protein will be apparent to one of ordinary skill in the art. 

According to another aspect, the invention provides a method for isolating a 
SOC/CRAC molecule having SOC/CRAC calcium channel activity. The method involves 
contacting a binding molecule that is a SOC/CRAC nucleic acid or a SOC/CRAC binding 
polypeptide with a sample containing one or more SOC/CRAC molecules under conditions 
that allow such binding (see earlier discussion) to form a complex, detecting the presence of 
the complex, isolating the SOC/CRAC molecule from the complex, and determining whether 
the isolated SOC/CRAC molecule has SOC/CRAC calcium channel activity. Thus, the 
invention is useful for identifying and isolating full length complementary (cDNA) or 
genomic nucleic acids encoding SOC/CRAC polypeptides having SOC/CRAC calcium 
channel activity. Identification and isolation of such nucleic acids and polypeptides may be 
accomplished by hybridizing/binding, imder appropriate conditions well known in the art, 
libraries and/or restriction enzyme-digested human nucleic acids, with a labeled SOC/CRAC 
molecular probe. As used herein, a "labeF' includes molecules that are incorporated into, for 
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example, a SOC/CRAC molecule (nucleic acid or peptide), that can be directly or indirectly 
detected. A wide variety of detectable labels are well known in the art that can be used, and 
include labels that provide direct detection (e.g., radioactivity, luminescence, optical or 
electron density, etc), or indirect detection (e.g., epitope tag such as the FLAG epitope, 
enzyme tag such as horseseradish peroxidase, etc.). The label may be bound to a SOC/CRAC 
binding partner, or incorporated into the structure of the binding partner. 

A variety of methods may be used to detect the label, depending on the nature of the 
label and other assay components. For example, the label may be detected while boimd to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
detected through optical or electron density, radioactive emissions, nonradioactive energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. Once a library clone or 
hybridizing fragment is identified in the hybridization/binding reaction, it can be further 
isolated by employing standard isolation/cloning techniques known to those of skill in the art. 
See, generally, Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual^ 2nd 
Edition, Cold Spring Harbor Laboratory Press. In addition, nucleic acid amplification 
techniques well knovm in the art, may also be used to locate splice variants of calcium 
channel (or calcium channel subunits) wdth SOC/CRAC calcium channel activity. Size and 
sequence determinations of the amplification products can reveal splice variants. 

The foregoing isolated nucleic acids and polypeptides may then be compared to the 
nucleic acids and polypeptides of the present invention in order to identify homogeneity or 
divergence of the sequences, and be further characterized functionally to determine whether 
they belong to a family of molecules with SOC/CRAC calcium channel activity (for 
methodology see under the Examples section). 

The isolation of the SOC/CRAC cDNA and/or partial sequences thereof also makes it 
possible for the artisan to diagnose a disorder characterized by an aberrant expression of 
SOC/CRAC. These methods involve determining expression of the SOC/CRAC gene, and/or 
SOC/CRAC polypeptides derived therefrom. In the former situation, such determinations can 
be carried out via any standard nucleic acid determination assay, including the polymerase 
chain reaction, or assaying with labeled hybridization probes as exemplified below. In the 
latter situation, such determination can be carried out via any standard immunological assay 
using, for example, antibodies which bind to the SOC/CRAC protein. 
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The invention also embraces isolated peptide binding agents which, for example, can 
be antibodies or fragments of antibodies ("binding polypeptides"), having the ability to 
selectively bind to SOC/CRAC polypeptides. Antibodies include polyclonal and monoclonal 
antibodies, prepared according to conventional methodology. In certain embodiments, the 
5 invention excludes binding agents (e.g., antibodies) that bind to the polypeptides encoded by 
the nucleic acids ofSEQ IDNOs: 10, 12, 13, 14, 15, 17, and 19. 

Significantly, as is well-known in the art, only a small portion of an antibody 
molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in 
general, Clark, W.R. (1986) The Experimental Foundations of Modem Immunology Wiley & 

JO Sons, Inc., New York; Roitt, I. (1991) Essential Immunology , 7th Ed., Blackwell Scientific 
Publications, Oxford). The pFc' and Fc regions, for example, are effectors of the complement 
cascade but are not involved in antigen binding. An antibody from which the pFc' region has 
been enzymatically cleaved, or which has been produced without the pFc' region, designated 
an F(ab')2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 

J5 an antibody from which the Fc region has been enzymatically cleaved, or which has been 
produced vsithout the Fc region, designated an Fab fragment, retains one of the antigen 
binding sites of an intact antibody molecule. Proceeding further. Fab fragments consist of a 
covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 
The Fd fragments are the major determinant of antibody specificity (a single Fd fragment may 

20 be associated with up to ten different light chains without altering antibody specificity) and Fd 
fragments retain epitope-binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there 
are complementarity determining regions (CDRs), which directly interact with the epitope of 
the antigen, and framework regions (FRs), which maintain the tertiary structure of the 

25 paratope (see, in general, Clark, 1986: Roitt, 1991). In both the heavy chain Fd fragment and 
the light chain of IgG immunoglobulins, there are four framework regions (FRl through FR4) 
separated respectively by three complementarity determining regions (CDRl through CDR3). 
The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, 
are largely responsible for antibody specificity. 

30 It is now well-established in the art that the non-CDR regions of a manunalian 

antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
while retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "humanized" antibodies in which non-himian CDRs 
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are covalently joined to human FR and/or Fc/pFc' regions to produce a functional antibody. 
Thus, for example, PCT International Publication Number WO 92/04381 teaches the 
production and use of humanized murine RSV antibodies in which at least a portion of the 
murine FR regions have been replaced by FR regions of human origin. Such antibodies, 
including fragments of intact antibodies with antigen-binding ability, are often referred to as 
"chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab')2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or 
FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab')2 fragment antibodies in which 
the FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 
FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDRl and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 

Thus, the invention involves binding polypeptides of numerous size and type that bind 
selectively to SOC/CRAC polypeptides, and complexes containing SOC/CRAC polj^eptides. 
These binding polypeptides also may be derived also from sotirces other than antibody 
technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in immobilized form, as bacterial 
flagella peptide display libraries or as phage display libraries. Combinatorial libraries £dso 
can be synthesized of peptides containing one or more amino acids. Libraries further can be 
synthesized of peptides and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml3, fd, or 
lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
One then can select phage-bearing inserts which bind to the SOC/CRAC polypeptide or a 
complex containing a SOC/CRAC polypeptide. This process can be repeated through several 
cycles of reselection of phage that bind to the SOC/CRAC polypeptide or complex. Repeated 
rounds lead to enrichment of phage bearing particular sequences. DN A sequence analysis can 
be conducted to identify the sequences of the expressed polypeptides. The minimal linear 
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portion of the sequence that binds to the SOC/CRAC polypeptide or complex can be 
determined. One can repeat the procedure using a biased library containing inserts containing 
part or all of the minimal linear portion plus one or more additional degenerate residues 
upstream or downstream thereof. Yeast two-hybrid screening methods also may be used to 
identify polypeptides that bind to the SOC/CRAC polypeptides. Thus, the SOC/CRAC 
polypeptides of the invention, or a fragment thereof, or complexes of SOC/CRAC can be used 
to screen peptide libraries, including phage display libraries, to identify and select peptide 
binding polypeptides that selectively bind to the SOC/CRAC polypeptides of the invention. 
Such molecules can be used, as described, for screening assays, for pxirification protocols, for 
interfering directly with the functioning of SOC/CRAC and for other purposes that will be 
apparent to those of ordinary skill in the art. 

A SOC/CRAC polypeptide, or a fragment thereof, also can be used to isolate naturally 
occurring, polypeptide binding partners which may associate with the SOC/CRAC 
polypeptide in the membrane of a cell. Isolation of binding partners may be performed 
according to well-known methods. For example, isolated SOC/CRAC polypeptides can be 
attached to a substrate, and then a solution suspected of containing an SOC/CRAC binding 
partner may be applied to the substrate. If the binding partner for SOC/CRAC polypeptides is 
present in the solution, then it will bind to the substrate-boimd SOC/CRAC polypeptide. The 
binding partner then may be isolated. Other proteins which are binding partners for 
SOC/CRAC, may be isolated by similar methods without undue experimentation. 

The invention also provides novel kits which could be used to measure the levels of 
the nucleic acids of the invention, expression products of the invention or anti-SOC/CRAC 
antibodies. In the case of nucleic acid detection, pairs of primers for amplifying SOC/CRAC 
nucleic acids can be included. The preferred kits would include controls such as known 
amounts of nucleic acid probes, SOC/CRAC epitopes (such as SOC/CRAC expression 
products) or anti-SOC/CRAC antibodies, as well as instructions or other printed material. In 
certain embodiments the printed material can characterize risk of developing a disorder that is 
characterized by aberrant SOC/CRAC polypeptide expression based upon the outcome of the 
assay. The reagents may be packaged in containers and/or coated on wells in predetermined 
amounts, and the kits may include standard materials such as labeled immunological reagents 
(such as labeled anti-IgG antibodies) and the like. One kit is a packaged polystyrene 
microtiter plate coated with a SOC/CRAC polypeptide and a container containing labeled 
anti-human IgG antibodies. A well of the plate is contacted with, for example, serum, washed 
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and then contacted with the anti-IgG antibody. The label is then detected. A kit embodying 
features of the present invention is comprised of the following major elements: packaging an 
agent of the invention, a control agent, and instructions. Packaging is a box-like structure for 
holding a vial (or number of vials) containing an agent of the invention, a vial (or number of 
vials) containing a control agent, and instructions. Individuals skilled in the art can readily 
modify packaging to suit individual needs. 

Another aspect of the invention is a method for determining the level of SOC/CRAC 
expression in a subject. As used herein, a subject is a human, non-himian primate, cow, 
horse, pig, sheep, goat, dog, cat or rodent. In all embodiments, human subjects are preferred. 
Expression is defined either as SOC/CRAC mRNA expression or SOC/CRAC polypeptide 
expression. Various methods can be used to measure expression. Preferred embodiments of 
the invention include PCR and Northem blotting for measuring mRNA expression, and 
monoclonal or polyclonal SOC/CRAC antisera as reagents to measure SOC/CRAC 
polypeptide expression. In certain embodiments, test samples such as biopsy samples, and 
biological fluids such as blood, are used as test samples. SOC/CRAC expression in a test 
sample of a subject is compared to SOC/CRAC expression in control sample to, e.g., assess 
the presence or absence or stage of a proliferative disorder (e.g., a lymphocyte proliferative 
disorder) in a subject. 

SOC/CRAC polypeptides preferably are produced recombinantly, although such 
polypeptides may be isolated firom biological extracts. Recombinantly produced SOC/CRAC 
polypeptides include chimeric proteins comprising a fusion of a SOC/CRAC protein with 
another polypeptide, e.g., a polypeptide capable of providing or enhancing protein-protein 
binding, sequence specific nucleic acid binding (such as GAL4), enhancing stability of the 
SOC/CRAC polypeptide imder assay conditions, or providing a detectable moiety, such as 
green fluorescent protein. A polypeptide fused to a SOC/CRAC polypeptide or fragment may 
also provide means of readily detecting the fusion protein, e.g., by immunological recognition 
or by fluorescent labeling. 

The invention is also useful in the generation of transgenic non-human animals. As 
used herein, ''transgenic non-human animals'" includes non-human animals having one or 
more exogenous nucleic acid molecules incorporated in germ line cells and/or somatic cells. 
Thus the transgenic animal include "knockout" animals having a homo^gous or 
heterozygous gene disruption by homologous recombination, animals having episomal or 
chromosomally incorporated expression vectors, etc. Knockout animals can be prepared by 
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homologous recombination using embryonic stem cells as is well known in the art. The 
recombination may be facilitated using, for example, the cre/lox system or other recombinase 
systems known to one of ordinary skill in the art. In certain embodiments, the recombinase 
system itself is expressed conditionally, for example, in certain tissues or cell types, at certain 
embryonic or post-embryonic developmental stages, inducibly by the addition of a compound 
which increases or decreases expression, and the like. In general, the conditional expression 
vectors used in such systems use a variety of promoters which confer the desired gene 
expression pattern (e.g., temporal or spatial). Conditional promoters also can be operably 
linked to SOC/CRAC nucleic acid molecules to increase expression of SOC/CRAC in a 
regulated or conditional manner. Tra/iy-acting negative regulators of SOC/CRAC calcium 
chaimel activity or expression also can be operably linked to a conditional promoter as 
described above. Such trans-acting regulators include antisense SOC/CRAC nucleic acids 
molecules, nucleic acid molecules which encode dominant negative SOC/CRAC molecules, 
ribozyme molecules specific for SOC/CRAC nucleic acids, and the like. The transgenic non- 
human animals are useful in experiments directed toward testing biochemical or physiological 
efTects of diagnostics or therapeutics for conditions characterized by increased or decreased 
SOC/CRAC expression. Other uses will be apparent to one of ordinary skill in the art. 

The invention further provides efficient methods of identifying agents or lead 
compoxmds for agents active at the level of a SOC/CRAC polypeptide (e.g., a SOC/CRAC 
polj^eptide) or SOC/CRAC fragment dependent cellular function. In particular, such 
functions include interaction with other polypeptides or fragments thereof, and selective 
binding to certain molecules (e.g., agonists and antagonists). Generally, the screening 
methods inyolve assaying for compounds which interfere with SOC/CRAC calcium charmel 
activity, although compounds which enhance SOC/CRAC calcium channel activity also can 
be assayed using the screening methods. Such methods are adaptable to automated, high 
throughput screening of compounds. The target therapeutic indications for pharmacological 
agents detected by the screening methods are limited only in that the target cellular function 
be subject to modulation by alteration of the formation of a complex comprising a 
SOC/CRAC polypeptide or fiagment thereof and one or more SOC/CRAC binding targets. 
Target indications include cellular processes modulated by SOC/CRAC such as Ca^"^ fluxing, 
and affected by SOC/CRAC ability to form complexes with other molecules and polypeptides 
as, for example, may be present in the cell membrane. 
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A wide variety of assays for phaxmacologica] ^ents are provided^ including, 
expression assays, labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays, cell-based assays such as calcium transport assays, etc. For 
example, two-hybrid screens are used to rapidly examine the effect of transfected nucleic 
acids on the intracellular binding of SOC/CRAC or SOC/CRAC fragments to specific 
intracellular targets (e.g. a tyrosine kinase). The transfected nucleic acids can encode, for 
example, combinatorial peptide libraries or cDNA libraries. Convenient reagents for such 
assays, e.g., GAL4 fusion proteins, are known in the art. An exemplary cell-based assay 
involves transfecting a cell with a nucleic acid encoding a SOC/CRAC polypeptide fused to a 
GAL4 DNA binding domain and a nucleic acid encoding a reporter gene operably linked to a 
gene expression regulatory region, such as one or more GAL4 binding sites. Activation of 
reporter gene transcription occurs when the SOC/CRAC and reporter fusion polypeptides bind 
such as to enable transcription of the reporter gene. Agents which modulate a SOC/CRAC 
polypeptide mediated cell function are then detected through a change in the expression of 
reporter gene. Methods for determining changes in the expression of a reporter gene are 
known in the art. 

In an expression system, for example, a SOC/CRAC polypeptide is attached to a 
membrane, the membrane preferably separating two fluid environments and being otherwise 
not permeable to Ca^"^. Such separation is preferred so that a change in Ca^^ concentration on 
either side of the membrane is mediated only through the attached SOC/CRAC polypeptide. 
Preferably, a SOC/CRAC polypeptide is expressed in an intact cell and is present on the cell- 
membrane (as in physiologic conditions). The cell expressing the SOC/CRAC polypeptide is 
preferably a eukaryotic cell, and the SOC/CRAC polypeptide is preferably recombinantly 
expressed, although cells naturally expressing a SOC/CRAC polypeptide may also be used. 
Synthetic membranes, however, containing SOC/CRAC polypeptides may also be used. See, 
e.g., K. Kiselyov, et al.. Functional interaction between lnsP3 receptors and store-operated 
Htrp3 channels. Nature 396 . 478-82 (1998). 

The cell expressing the SOC/CRAC polypeptide is incubated under conditions which, 
in the absence of the candidate agent, permit calcium flux into the cell and allow detection of 
a reference calcium concentration. For example, depletion of intracellular calcium stores with 
thapsigargin or other agents (Putney, J.W. Jr., in Capacitative Calcium Entrv, R.G. Landes 
Co. and Chapman & Hall, 1997) would produce a given level of SOC/CRAC channel 
activation and a given reference calcium concentration. Detection of a decrease in the 
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foregoing activities (i.e., a decrease in the intracellular calcium concentration) relative to the 
reference calcium concentration indicates that the candidate agent is a lead compound for an 
agent to inhibit SOC/CRAC calcium channel activity. Preferred SOC/CRAC polypeptides 
include the polypeptides of claim 15. 

SOC/CRAC fragments used in the methods, when not produced by a transfected 
nucleic acid are added to an assay mixture as an isolated polypeptide. SOC/CRAC 
polypeptides preferably are produced recombinantly, although such polypeptides may be 
isolated from biological extracts or chemically synthesized. Recombinantly produced 
SOC/CRAC polypeptides include chimeric proteins comprising a fusion of a SOC/CRAC 
protein with another polypeptide, e.g., a polypeptide capable of providing or enhancing 
protein-protein binding, sequence specific nucleic acid binding (such as GAL4), enhancing 
stability of the SOC/CRAC polypeptide under assay conditions, or providing a detectable 
moiety, such as green fluorescent protein or Flag epitope. 

The assay mixture is comprised of a SOC/CRAC polypeptide binding target 
(candidate agent) capable of interacting with a SOC/CRAC polypeptide. While natural 
SOC/CRAC binding targets may be used, it is frequently preferred to use portions (e.g., 
peptides or nucleic acid fragments) or analogs (i.e., agents which mimic the SOC/CRAC 
binding properties of the natural binding target for purposes of the assay) of the SOC/CRAC 
binding target so long as the portion or analog provides binding affinity and avidity to the 
SOC/CRAC polypeptide (or fragment thereof) measurable in the assay. 

The assay mixture also comprises a candidate agent (binding target, e.g., 
agonist/antagonist). Typically, a plurality of assay mixtures are run in parallel with different 
agent concentrations to obtain a different response to the various concentrations- Typically, 
one of these concentrations serves as a negative control, i.e., at zero concentration of agent or 
at a concentration of agent below the limits of assay detection. Candidate agents encompass 
numerous chemical classes, although typically they are organic compounds. Preferably, the 
candidate agents are small organic compoimds, i.e., those having a molecular weight of more 
than 50 yet less than about 2500, preferably less than about 1000 and, more preferably, less 
than about 500. Candidate agents comprise functional chemical groups necessary for 
structural interactions with polypeptides and/or nucleic acids, and typically include at least an 
amine,, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional 
chemical groups and more preferably at least three of the functional chemical groups. The 
candidate agents can comprise cyclic carbon or heterocyclic structure and/or aromatic or 
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polyaromatic structures substituted with one or more of the above-identified functional 
groups. Candidate agents also can be biomolecules such as peptides, saccharides, fatty acids, 
sterols, isoprenoids, purines, pyriraidines, derivatives or structural analogs of the above, or 
combinations thereof and the like. Where the agent is a nucleic acid, the agent typically is a 
DNA or RNA molecule, although modified nucleic acids as defined herein are also 
contemplated. 

Candidate agents are obtained fi-om a wide variety of sources including libraries of 
synthetic or natural compounds. For example, mmierous means are available for random and 
directed synthesis of a wide variety of orgemic compounds and biomolecules, including 
expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 
display libraries of random peptides, and the like. Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural and synthetically produced libraries and compounds can be 
readily modified through conventional chemical, physical, and biochemical means. Further, 
knovsm agents may be subjected to directed or random chemical modifications such as 
acylation, alkylation, esterification, amidification, etc. to produce structxiral analogs of the 
agents. Non-SOC/CRAC calcium channel agonists and antagonists, for example, include 
agents such as dihydropyridines (DHPs), phenylalkylamines, omega conotoxin 
(omega.-CgTx) and pyrazonoylguanidines. 

A variety of other reagents also can be included in the mixture. These include 
reagents such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be 
used to facilitate optimal protein-protein, protein-nucleic acid, and/or protein/membrane 
component binding association. Such a reagent may also reduce non-specific or backgroimd 
interactions of the reaction components. Other reagents that improve the efficiency of the 
assay such as protease, inhibitors, nuclease inhibitors, antimicrobial agents, and the like may 
also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, 
but for the presence of the candidate agent, the SOC/CRAC polypeptide specifically binds the 
cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay 
may be readily determined. Such experimentation merely involves optimization of the assay 
parameters, not the fundamental composition of the assay. Incubation temperatures typically 
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are between 4®C and 40°C. Incubation times preferably are minimized to facilitate rapid, high 
throughput screening, and typically are between 0.1 and 10 hours. 

After incubation, the presence or absence of specific binding between the SOC/CRAC 
polypeptide and one or more binding targets is detected by any convenient method available 
to the user. For cell free binding type assays, a separation step is often used to separate bound 
fi-om unbound components. The separation step may be accomplished in a variety of ways. 
Conveniently, at least one of the components is immobilized on a solid substrate, fi-om which 
the unbound components may be easily separated. The solid substrate can be made of a wide 
variety of materials and in a wide variety of shapes, e.g., microtiter plate, microbead, dipstick, 
resin particle, etc. The substrate preferably is chosen to maximum signal to noise ratios, 
primarily to minimize background binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a 
reservoir, emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, 
particle, chromotograpic column or filter with a wash solution or solvent. The separation step 
preferably includes multiple rinses or washes. For example, when the solid substrate is a 
microtiter plate, the wells may be washed several times with a washing solution, which 
typically includes those components of the incubation mixture that do not participate in 
specific bindings such as salts, buffer, detergent, non-specific protein, etc. Where the solid 
substrate is a magnetic bead, the beads may be washed one or more times with a v^hing 
solution and isolated using a magnet. 

Detection may be effected in any convenient way for cell-based assays such as two- or 
three-hybrid screens. The transcript resulting from a reporter gene transcription £issay of 
SOC/CRAC polypeptide interacting with a target molecule typically encodes a directly or 
indirectly detectable product, e.g., P-gaiactosidase activity, luciferase activity, and the like. 
For cell-free binding assays, one of the components usually comprises, or is coupled to, a 
detectable label. A wide variety of labels can be used, such as those that provide direct 
detection (e.g., radioactivity, luminescence, optical or electron density, etc.) or indirect 
detection (e.g., epitope tag such as the FLAG epitope, enzyme tag such as horseseradish 
peroxidase, etc.). The label may be bound to a SOC/CRAC binding partner, or incorporated 
into the structure of the binding partner. 

A variety of methods may be used to detect the label, depending on the nature of the 
label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
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detected through optical or electron density, radioactive emissions, nonradiative energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. 

Of particular importance in any of the foregoing assays £ind binding studies is the use 
of a specific sequence motif identified in the SOC-2/CRAC-1 polypeptide sequence as a 
kinase catalytic domain. According to the invention, amino acids 999-1180 of the SOC- 
2/CRAC-l polypeptide (SEQ ID NO:24) (or a fragment thereof), show a localized homology 
with the catalytic domains of eukaryotic elongation factor-2 kinase (eEF-2 kinase, GenBank 
Acc. no. U93850) and Dictyostelium myocin heavy chain kinase A (MHCK A, GenBank Acc. 
no. U16856), as disclosed in Ryazanov AG, et al., Proc Natl Acad Sci USA, 1997, 
94(10):4884-4889. Therefore, according to the invention, a method for identifying agents 
useful in the modulation of SOC/CRAC polypeptide kinase activity is provided. The method 
involves contacting a SOC/CRAC polypeptide with kinase activity, that includes, for 
example, ammo acids 999- 11 80 of the SOC-2/CRAC-1 polypeptide (SEQ ID NO:24) vwth a 
candidate agent suspected of modulating SOC/CRAC kinase activity, under conditions 
sufficient to allow the candidate agent to interact with the SOC/CRAC polypeptide and 
modulate its kinase activity; detecting a kinase activity associated with the SOC/CRAC 
polypeptide in the presence of the candidate agent; and comparing the kinase activity in the 
previous step with a control kinase activity of a SOC/CRAC polypeptide in the absence of the 
candidate agent to determine whether the candidate agent modulates (increases or decreases) 
SOC/CRAC kinase activity. Other controls for kinase activity can also be performed at the 
same time, for example, by utilizing eEF-2 kinase and/or Dictyostelium MHC Kinase A, in a 
similar manner to the SOC/CRAC member. Methods for performing such kinase activity 
assays are well known in the art. 

The invention thus provides SOC/CRAC-specific binding agents, methods of 
identifying and making such agents, and their use in diagnosis, therapy and pharmaceutical 
development. For example, SOC/CRAC-specific agents are useful in a variety of diagnostic 
and therapeutic applications, especially where disease or disease prognosis is associated with 
altered SOC/CRAC and, SOC/CRAC calcium channel fluxing characteristics. Novel 
SOC/CRAC-specific binding agents include SOC/CRAC-specific antibodies and other natural 
intracellular and extracellular binding agents identified with assays such as two hybrid 
screens, and non-natural intracellular and extracellular binding agents identified in screens of 
chemical libraries and the like. 
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In general, the specificity of SOC/CRAC binding to a specific molecule is determined 
by binding equilibrium constants. Targets which are capable of selectively binding a 
SOC/CRAC polypeptide preferably have binding equilibrium constants of at least about 10^ 
NT^ more preferably at least about 10* M'\ and most preferably at least about 10^ M"'. The 
wide variety of cell based and cell free assays may be used to demonstrate SOC/CRAC- 
specific binding. Cell based assays include one, two and three hybrid screens, assays in 
which SOC/CRAC-mediated transcription is inhibited or increased, etc. Cell free assays 
include SOC/CRAC-protein binding assays, inununoassays, etc. Other assays useful for 
screening agents which bind SOC/CRAC polypeptides include fluorescence resonance energy 
transfer (FRET), and electrophoretic mobility shift analysis (EMSA). 

Various techniques may be employed for introducing nucleic acids of the invention 
into cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. 
Such techniques include transfection of nucleic acid-CaP04 precipitates, transfection of 
nucleic acids associated with DEAE, transfection with a retrovirus including the nucleic acid 
of interest, liposome mediated transfection, and the like. For certain uses, it is preferred to 
target the nucleic acid to particular cells. In such instances, a vehicle used for delivering a 
nucleic acid of the invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have 
a targeting molecule attached thereto. For example, a molecule such as an antibody specific 
for a surface membrane protein on the target cell or a ligand for a receptor on the target cell 
can be bound to or incorporated within the nucleic acid delivery vehicle. • For example, where 
liposomes are employed to deliver the nucleic acids of the invention, proteins which bind to a 
surface membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 
fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 
internalization in cycling, proteins that target intracellular localization and enhance 
intracellular half life, and the like. Polymeric delivery systems also have been used 
successfully to deliver nucleic acids into cells, as is known by those skilled in the art. Such 
systems even permit oral delivery of nucleic acids. 

Other delivery systems can include time-release, delayed release or sustained release 
delivery systems. Such systems can avoid repeated administrations of the anti-inflammatory 
agent, increasing convenience to the subject and the physician. Many types of release 
delivery systems are aveiilable and known to those of ordinary skill in the art. They include 
polymer base systems such as poly(lactide-glycolide), copolyoxalates, polycaprolactones, 
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polyesteramides, polyorthoesters, polyhydroxybut>Tic acid, and polyanhydrides. 
Microcapsules of the foregoing polymers containing drugs are described in, for example, U.S. 
Patent 5,075,109. Delivery systems also include non-pol3aner systems that are: lipids 
including sterols such as cholesterol, cholesterol esters and fatty acids or neutral fats such as 
mono- di- and tri-glycerides; hydrogel release systems; sylastic systems; peptide based 
systems; wax coatings; compressed tablets using conventional binders and excipients; 
partially fused implants; and the like. Specific examples include, but are not limited to: (a) 
erosional systems in which an agent of the invention is contained in a form within a matrix 
such as those described in U.S. Patent Nos. 4,452,775, 4,675,189, and 5,736,152, and (b) 
difiusional systems in which an active component permeates at a controlled rate from a 
polymer such as described in U.S. Patent Nos. 3,854,480, 5,133,974 and 5,407,686. In 
addition, pump-based hardware delivery systems can be used, some of which are adapted for 
implantation. 

Use of a long-term sustained release implant may be particularly suitable for treatment 
of chronic conditions. Long-term release, are used herein, means that the implant is 
constructed and arranged to delivery therapeutic levels of the active ingredient for at least 30 
days, and preferably 60 days. Long-tenn sustained release implants are well-known to those 
of ordinary skill in the art and include some of the release systems described above. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 
gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history 
of that patent, all of which are publicly available documents. In general, it involves 
introduction in vitro of a functional copy of a gene into a cell(s) of a subject which contains a 
defective copy of the gene, and returning the genetically engineered cell{s) to the subject. The 
functional copy of the gene is under operable control of regulatory elements which permit 
expression of the gene in the genetically engineered cell(s). Numerous transfection and 
transduction techniques as well as appropriate expression vectors are well known to those of 
ordinary skill in the art, some of which are described in PCT application WO95/00654. In 
vivo gene therapy using vectors such as adenovirus, retroviruses, herpes virus, and targeted 
liposomes also is contemplated according to the invention. See, e.g., U.S. Patent Nos. 
5,670,488, entitled "Adenovirus Vector for Gene Ther^y", issued to Gregory et al., and 
5,672,344, entitled "Viral-Mediated Gene Transfer System", issued to Kelley et al. 
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The invention will be more fully understood by reference to the following examples. 
These examples, however, are merely intended to illustrate the embodiments of the invention 
and are not to be construed to limit the scope of the invention. 

Examples 

As an initial approach to identifying SOC/CRAC channels, we considered publicly 
available data and hypothesized that the following characteristics are likely to be exhibited by 
SOC/CRAC calcium channels: i) SOC/CRAC calcium channels would be integral membrane 
proteins related (probably distantly) to one of the known calcium channel families (e.g. 
voltage gated, ligand gated, Trp), and therefore should have a pore region formed by a 
tetramer of 6-7 transmembrane (TM) regions; ii) high calcium selectivity was likely to come 
at the price of complexity, and therefore these were likely to be large proteins; iii) the high 
calcium selectivity of this type of channel was likely to be useful and, therefore, highly 
conserved; and iv) these channels should be expressed in one or more types of lymphocytes, 
since ICRAC is best defined in those cell types. Since the fiill genome of the nematode C 
elegans is nearing completion, and IP3 -dependent calcium signals have recently been shown 
to be required for one or more aspects of C. elegans development, we took the set of proteins 
encoded by this genome (at the time this search was initiated WORMPEP14 was the available 
predicted protein set) and began searching for proteins which fit the criteria above. This 
search began by proceeding in alphabetical order through WORMPEP14 and arbitrarily 
excluding all proteins below approximately 1000 amino acids in size, followed by focusing on 
remaining proteins with clear TM spanning regions similar to those of other calcium channels. 
We stopped this screen on encountering a protein designated C05C12.3, a predicted protein 
of 1816 amino acids (SEQ ID NO: 13). C05C12.3 was noteable because its central pore 
region had some sequence similarity to but was clearly distinct from members of the Trp 
family of calcium channels, and the hydrophobicity plot of this region showed a 
characteristically wide spacing between the fifth and sixth TM regions for the amino acid 
residues which are thought to line the channel pore region and mediate the calcium selectivity 
of the channels. In addition, it lacked any ankyrin repeats in the region amino-terminal to its 
pore region, further distinguishing it from other Trp family proteins. 

We then used C05C12.3 for BLAST alignment screening of the rest of the C elegans 
genome and also manunalian databases for homologous proteins, revealing two other C 
elegans homologues (SEQ ID NO: 14 and SEQ ID NO: 15), and also a recently cloned 
mammalian protein named melastatin-1 (MLSN-l/SOC-1, SEQ ID NOs:9 and 10, and 
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GenBank Acc. No. AF071787). Using these sequences, we subsequently performed an 

exhaustive screening of publicly accessible EST databases in search of lymphocyte 

homologues, but were unsuccessful in detecting any homologous transcripts in any 

lymphocyte lines. Since MLSN-1 (SEQ ID NOs:9 and 10) was expressed exclusively in 

melanocytes and retina by Northern blot hybridization and by EST database searching, there 

was no evidence that this type of channel was expressed in the type of cell in which ICRAC- 

like currents were best defined. Subsequent BLAST searches picked up mouse EST 

sequence AI098310 (SEQ ID NO:22) from a monocyte cell line. The I.M.A.G.E. consortiimi 

clone containing the above-identified EST was then purchased firom ATCC (clone ID. 

1312756, Manassas, VA) and was further characterized. Using other portions of this 

sequence in EST searches, we subsequendy picked up similzir sequences in human B-cells 

(SEQ ID NOs:20 and 21), and other cell types as well (SEQ ID NOs: 1 1, 12, 16, 17, 18, and 

19). Most of these sequences were subsequently identified to be part of the 3*-UTR or of the 

carboxy terminal region of the proteins, which are not readily identifiable as Trp channels, 

providing an explanation for the art's inability to detect any type of Trp related transcripts in 

lymphocytes. Partial sequences fi*om the 5' and/or 3' ends of the above identified clones were 

then used to screen leukocyte and kidney cDNA libraries to extend the original sequences 

more toward the 5' and/or 3' ends. 

In view of the foregoing, it was concluded that channels of this type were expressed in 

many types of lymphocytes, and therefore were members of a new family of SOC/CRAC 

calcium chaimels. 

Experimental Procedures 
Screening of the cDNA libraries 

Leukocyte and kidney cDNA libraries fi-om Life Technologies (Gaithersburg, MD) 

were screened using the Gene Trapper II methodology (Life Technologies) according to 

manufacturer's recommendation, using the inserts of LM.A.G.E. clone ID nos. 1312756 and 

1076485 from ATCC (Manassas, VA), imder stringent hybridization conditions. Using 

standard methodology ( Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds.. 

Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, 

or Current Protocols in Molecular Biology, P.M. Ausubel, et al., eds., John Wiley & Sons, 

Inc., New York), individual cDNA clones were subjected to 3-4 rounds of amplification and 

purification imder the same hybridization conditions. 

After excision from the vector and subcloning of inserts into the plasmid forms, 

several clones were sequenced by the Beth Israel Deaconess Medical Center's Automated 
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Sequencing Facility. Molecular biological techniques such as restriction enzyme treatment, 
subcloning, DNA extraction, bacterial culture and purification of DNA fragments were 
performed according to methods well known in the art. Computer analyses of protein and 
DNA sequences was done using "Assemblylign" (Oxford Molecular, Cambell, CA). Multiple 
alignments of the SOC/CRAC family members were produced using the CLUSTAL facility 
of the MacVector program. Restriction endonucleases, expression vectors, and modifying 
enzymes were purchased from commercial sources (Gibco-BRL). Sequencing vectors for 
DNA were purchased from Stratagene (La JoUa, CA). 

Once the first members of what appeared to be a novel family of czdcium channel 
receptors were identified and characterized, additional BLAST alignments were performed 
with the newly characterized nucleic acid sequences. An initial match was with genomic 
DNA fragment NH0332L1 1 (Genbank Acc. No. AC005538). Using this genomic sequence, 
promers were designed and a number of cDNA libraries was surveyed by PGR. A prostate 
specific message was identified and characterized, leading to the isolation and 
characterization of SOC-4/CRAC-3 (SEQ ID NOs: 31 and 32). 
Functional Assays 

Transient Expression of SOC/CRAC 

In our initial transient expression experiments, we expressed or expect to express a 
SOC/CRAC molecule transiently in RBL-2H3 mast cells, Jurkat T cells, and A20 
B-lymphocytes using both electroporation and vaccinia virus-driven expression, and 
measured the calcixmi influx produced by depletion of intracellular calcium stores with 
thapsigargin. Each of the foregoing techniques is well known to those of ordineiry skill in the 
art and can be performed using various methods (see, e.g.. Current Methods in Molecular 
Biology, eds. Ausubal, F.M., et al. 1987, Green Publishers and Wiley Interscience, N.Y., 
N. Y.). Exemplary methods are described herein. 

Depletion of intracellular calcium stores is accomplished by treating the cells with 1 
micromolar thapsigargin; edtemative agents which function to deplete intracellular stores are 
described in by Putney, J.W. Jr., in Capacitative Calcium Entry. R.G. Landes Co. and 
Chapman & Hall, 1997 and include, for example, ionomycin, cyclopiazonic acid, and DBHQ. 

Calcium influx is determined by measuring cjrtoplasmic calcium as indicated using the 
fura-2 fluorescent calcium indicator (see, e.g., G. Grynkiewicz, M. Poenie, R. Y. Tsien, A 
new generation of Ca2+ indicators with greatly improved fluorescence properties, J. Biol 
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Chem 260, 3440-50 (1985), and M. Poenie, R. Tsien, Fura-2: a powerful new tool for 
measuring and imaging [Ca2+]i in single cells. Prog Clin Biol Res 210 > 53-6 (1986)). 

Patch Clamp Analysis and Determinins^ Selectivity of SOC/CRAC 

Patch clamp analysis of cells injected with SOC/CRAC cRNA is performed by using 
the general patch technique as described in Neher, E-, "Ion channels for communication 
between and within cells". Science, 1992 : 256:498-502. Specific techniques for applying the 
patch clamp analysis to RBL cells are described in Hoth, M., and Penner, R., "Depletion of 
intracellular calcium stores activates a calcium current in mast ceils". Nature. 1992 : 
355:3535-355. Additional protocols for applying the patch clamp technique to other cell 
types are described in Putney, J.W. Jr., in Capacitative Calcium Entry, R.G. Landes Co. and 
Chapman & Hall, 1997 

An exemplary protocol for patch clamp analysis of SOC/CRAC molecule expressed in 
RBL-2H3 mast cells using a recombinant vaccinia virus is as follows. The currents elicited 
by store depletion are determined using the whole cell configuration (Neher, E., Science^ 
i992 : 256:498-502). Currents in SOC/CRAC expressing cells are compared to currents in 
control cells expressing an irrelevant protein or a classic Trp family calcium channel known 
as VRl (M. J. Caterina, et al.. The c£^saicin receptor: a heat-activated ion channel in the pain 
pathway [see conmients]. Nature 389, 816-24 (1997)) in order to assess the contribution of 
SOC/CRAC expression. In addition, the magnitude of whole cell currents in the presence of 
extracellular calcium (10 mM), barium (10 mM), or magnesium (10 mM) are compared to 
determine the relative permeability of the channels to each of these ions (Hoth, M., and 
Penner, R., Nature. 1992 : 355:3535-355) and, thereby, determine the ionic selectivity. 

Pharmacologic Behavior of SOC/CRAC 

For analysis of the pharmacologic behavior of a SOC/CRAC molecule, a SOC/CRAC 
molecule is expressed in RBL-2H3 mast cells using a recombinant vaccinia virus, £ind the 
degree of calcium influx elicited by store depletion is monitored using a bulk 
spectrofluorimeter or a fluorescence microscope and the calcium sensitive dye fura-2 (G. 
Grynkiewics:, M. Poenie, R. Y. Tsien, A new generation of Ca2'*" indicators with greatly 
improved fluorescence properties, J Biol Chem 260 , 3440-50 (1985) and M. Poenie, R. 
Tsien, Fura-2: a powerful new tool for measuring and imaging [Ca2+]i in single cells. Prog 
Clin Biol Res 210> 53-6 (1986)). The level of cytoplasmic calcium in SOC/CRAC 
expressing cells is compared to the level achieved in control cells expressing an irrelevant 
protein or a classic Tip. family calcium channels known as VRl (M. J. Caterina, et al.. The 
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capsaicin receptor: a heat-activated ion channel in the pain pathway [see comments]. Nature 
389, 816-24 (1997)), These cells then are pre-incubated with the desired pharmacologic 
reagent, and again the response to store depletion is monitored. Comparison of the effect of 
depleting stores in SOC/CRAC expressing cells relative to controls in the presence or absence 
of the pharmacologic reagent is used to assess the ability of that reagent to modulate 
SOC/CRAC activity. Sphingosine is an exemplary molecule that can be used as 
pharmacologic reagents for pharmacologic characterization of SOC/CRAC calcium charmels. 
See, e.g., Mathes, C, et al., Calcium release activated calcium current as a direct target for 
sphingosine, J Biol Chem 273f39) :25Q20-25030 (1998). Other non-specific calcium channel 
inhibitors that can be used for this purpose include SKR96365 (Calbiochem) and Lanthanum. 
Bulk Calcium Assays 

Bulk calcium assays can be performed in a PTI Deltascan bulk spectrofluorometer using fura- 
2 as described in Scharenberg AM, et aL, EMBO J, 1995, 14(14):3385-94. 
Gene Targeting 

The method (and reagents) described by Buerstedde JM et al, {Cell, 1991, Oct 4;67(1):179- 
88), was used to generate "knockouts" in cells. Briefly, part of the chicken SOC-2/CRAC-1 
genomic sequence coding for the transmembrane region was cloned utilizing the human 
sequence as the probe in a chicken library screen. Chicken SOC-2/CRAC-1 clones were 
isolated and characterized using standard methodology. The putative exon and domain 
arrangement of the chicken SOC-2/CRAC-1, is depicted in Figure 1. The exons coding for 
TM5 (pore region) and TM6, were replaced with promoter/antibiotic cassettes (see Figurel). 
These targeting vectors were then used to target (and replace) the endogenous gene in DT-40 
cells (chicken B lymphocyte cells). 

Results 

Example 1 : Transient Expression of SOC/CRAC 

In the above-identified cell lines and using both of the foregoing expression 
techniques, SOC/CRAC expression enhances thapsigargin-dependent influx. In addition, 
SOC/CRAC expression also enhances the amount of intracellular calcium stores. That this 
effect is likely due to SOC/CRAC acting as a plasma membrane calcium channel can be 
confirmed by producing an in-frame carboxy-terminal translational fusion with green 
fluorescent protein followed by confocal microscopy, revealing that SOC/CRAC is expressed 
predominantly as a plasma membrane calcium channel. 
Example 2 : Patch Clamp Analysis 
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The biophysical characteristics of SOC/CRAC enhanced currents when expressed in 
Xenopus oocytes are determined. SOC/CRAC cRNA injection is able to enhance 
thapsigargin-dependent whole cell currents. In addition. SOC/CRAC does not alter the 
reversal potential of these currents and the determination of the Pca/PNa ratio shows that 
SOC/CRAC channels are highly calcium selective. 
Example 3 : Pharmacologic Behavior of SOC/CRAC 

The pharmacologic behavior of SOC/CRAC is evaluated as described above. 
SOC/CRAC-enhanced influx is inhibited by sphingosine in a manner that is substantially the 
same as that of endogenous thapsigargin-dependent calcium influx. 
Example 4 : Gene tarsetins 

Transfection of DT-40 cells with the foregoing targeting vectors, selection for 
antibiotic resistance, and screening, is collectivelly refered to, herein, as a rotmd of targeting. 
For the first round of targeting SOC-2/CRAC-1, 18/24 clones with homologous 
recombination of the targeting construct into one of the endogenous SOC-2/CRAC-1 alleles 
were obtained. On the second round of targeting (in order to target the second allele and 
therefore generate a homozygous SOC-2/CRAC-1 mutant cell), 0/48 clones were obtained. 
These results indicate that a "null" SOC-2/CRAC-1 mutation is detrimental to DT-40 cells, 
and that SOC-2/CRAC-1 is required for cell viability. 

Table I. Nucleotide Sequences with homologies to SOC/CRAC nucleic acids 

Sequences with SEQ ID NOs and GenBank accession numbers: 
SEQ IDNO:9, AB001535, AI226731, H18835, AA419592, AA261842, AA419407, 
AA592910, D86107, A1098310, AF071787, Z77132, Z831 17, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, AC005538, AA654650, AA370110, AA313170, 
AA493512, AI670079, AI671853. 

Table 11. Amino Acid Sequences with homologies to SOC/CRAC polypeptides 

Sequences with SEQ ID NOs and GenBank accession numbers: 

SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:17, SEQ ID 
NO:19, AB001535, AA592910, D86107, AF071787, Z77132, Z83il7, Z68333, 
AA708532, AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP00241 1, 
CAA92726, CAB05572. 

All references, patents, and patent documents disclosed herein are incorporated by 
reference herein in their entirety. 

What is claimed is presented below and is followed by a Sequence Listing. We claim: 
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Claims 

1 . An isolated nucleic acid molecule, comprising: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic 
acid molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and 
SEQ ID NO:3 1 , and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO: 1 . 

3. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:27. 

4. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:29. 

5. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:31. 

6. An isolated nucleic acid molecule selected from the group consisting of 

(a) a unique fragment of a nucleic acid molecule selected from the group 
consisUng of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:29, and SEQ ID NO:31, 

(b) complements of (a), 

provided that the unique fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from a sequence group consisting of 

(1) sequences having the SEQ. ID NOS. or GenBank accession nmnbers of Table I, 

(2) complements of (1), and 

(3) fragments of (1) and (2). 
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7. The isolated nucleic acid molecule of claim 6, wherein the sequence of contiguous 
nucleotides is selected from the group consisting of: 

(1) at least two contiguous nucleotides nonidentical to the sequence group, 

(2) at least three contiguous nucleotides nonidentical to the sequence group, 

(3) at least four contiguous nucleotides nonidentical to the sequence group, 

(4) at least five contiguous nucleotides nonidentical to the sequence group, 

(5) at least six contiguous nucleotides nonidentical to the sequence group, 

(6) at least seven contiguous nucleotides nonidentical to the sequence group. 

8. The isolated nucleic acid molecule of claim 6, wherein the unique fragment has a size 
selected from the group consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 
14 nucleotides, 16 nucleotides, 18 nucleotides, 20, nucleotides, 22 nucleotides, 24 
nucleotides, 26 nucleotides, 28 nucleotides, 30 nucleotides, 50 nucleotides, 75 nucleotides, 
100 nucleotides, and 200 nucleotides. 

9. The isolated nucleic acid molecule of claim 6, wherein the molecule encodes a 
polypeptide which is immunogenic. 

10. An expression vector comprising the isolated nucleic acid molecule of claims 1, 2, 3, 
4, 5, 6, 7, 8, or 9 operably linked to a promoter. 

11. A host cell transformed or transfected with the expression vector of claim 1 0. 

12. An isolated polypeptide encoded by the isolated nucleic acid molecule according to 
anyone of claims 1 or 6, wherein the polypeptide comprises a SOC/CRAC polypeptide or a 
unique fragment thereof. 

13. The isolated polypeptide of claim 12, wherein the isolated polypeptide is encoded by 
the isolated nucleic acid molecule of claim 2, 3, 4, or 5. 

14. The isolated polypeptide of claim 13, wherein the isolated polypeptide comprises a 
polypeptide having the sequence of amino acids selected from the group consisting of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, and SEQ ID NO:32. 
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1 5. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 1, 2, 3, 
4, or 5, wherein the polypeptide, or unique fragment thereof is immunogenic. 

16. An isolated binding polypeptide which binds selectively to a polypeptide encoded by 
the isolated nucleic acid molecule of claim 1, 2, 3, 4, or 5. 

17. The isolated binding polypeptide of claim 16, wherein the isolated binding 
polypeptide binds to a polypeptide having the sequence of amino acids selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32. 

18. The isolated binding polypeptide of claim 17, wherein the isolated binding 
polypeptide is an antibody or an antibody fragment selected from the group consisting of a 
Fab fragment, a F(ab)2 fragment or a fragment including a CDR3 region selective for the 
polypeptide. 

19. An isolated polypeptide, comprising a unique fragment of the polypeptide of claim 12 
of sufficient length to represent a sequence unique within the human genome, provided that 
the fragment excludes a sequence of contiguous amino acids identified in Table II. 

20. A method for isolating a SOC/CRAC molecule having SOC/CRAC calcium channel 
activity, comprising: 

a) contacting a binding molecule that is a SOC/CRAC nucleic acid or a SOC/CRAC 
binding polypeptide with a sample containing one or more SOC/CRAC molecules, under 
conditions sufficient to form a complex of the SOC/CRAC nucleic acid or the SOC/CRAC 
binding polypeptide and the SOC/CRAC molecule; 

b) detecting the presence of the complex; 

c) isolating the SOC/CRAC molecule from the complex; and 

d) determining whether the isolated SOC/CRAC molecule has SOC/CRAC calcium 
channel activity. 

21 . The method of claim 20, wherein the binding molecule is a SOC/CRAC nucleic acid. 

22. The method of claim 20, wherein the binding molecule is a SOC/CRAC binding 
polypeptide. 
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23. The method of claim 21, wherein the SOC/CRAC nucleic acid comprises at least 14 
nucleotides from any contiguous portion of a sequence of nucleotides selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:3. SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:3 1 . 

24. A method for identifying agents useful in the modulation of SOC/CRAC calciimi 
channel activity, comprising: 

a) contacting a SOC/CRAC polypeptide with a candidate agent suspected of 
modulating SOC/CRAC calcium channel activity, under conditions sufficient to allow the 
SOC/CRAC polypeptide to interact selectively with the candidate agent; 

b) detecting a Ca^"^ concentration associated with SOC/CRAC calcium channel 
activity of the SOC/CRAC polypeptide in the presence of the candidate agent; and 

c) comparing the Ca^"*" concentration of step (b) with a control Ca^"*" concentration of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates SOC/CRAC calcium channel activity. 

25. A method for determining the level of SOC/CRAC expression in a subject, 
comprising: 

a) measuring the expression of SOC/CRAC in a test sample obtained from the 
subject, and 

b) comparing the measured expression of SOC/CRAC in the test sample to the 
expression of the SOC/CRAC polypeptide in a control to determine the level of SOC/CRAC 
expression in the subject 

26. The method of claim 25, wherein the expression of SOC/CRAC in (b) is SOC/CRAC 
mRNA expression. 

27. The method of claim 25, wherein the expression of SOC/CRAC in (b) is SOC/CRAC 
polypeptide expression. 

28. The method of claim 25, wherein the test sample is tissue. 



29. The method of claim 25, wherein the test sample is a biological fluid. 
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30. The method of claim 26, wherein SOC/CRAC mRNA expression is measured using 
the Polymerase Chain Reaction (PGR). 

31. The method of claim 26, wherein SOC/CRAC mRNA expression is measured using a 
method selected from the group consisting of northern blotting, monoclonal antisera to 
SOC/CRAC and polyclonal antisera to SOC/CRAC. 

32. A kit, comprising a package containing: 

an agent that selectively binds to the isolated nucleic acid of claim 1 or an 
expression product thereof, and 

a control for comparing to a measured value of binding of said agent to said 
isolated nucleic acid of claim 1 or expression product thereof. 

33. The kit of claim 32, wherein the control comprises an epitope of the expression 
product of the nucleic acid of claim 1. 

34. A pharmaceutical composition comprising: 

a pharmaceutically effective amount of an agent comprising of an isolated 
nucleic acid molecule of claim 1 or an expression product thereof, and 
a pharmaceutically acceptable carrier. 

35. The pharmaceutical composition of claim 34, wherein the agent is an expression 
product of the isolated nucleic acid molecule of claim 1. 

36. A method for identifying agents useful in the modulation of a SOC/CRAC polypeptide 
kinase activity, comprising: 

a) contacting a SOC/CRAC polypeptide with kinase activity wdth a candidate agent 
suspected of modulating SOC/CRAC kinase activity, imder conditions sufficient to allow the 
candidate agent to interact with the SOC/CRAC polypeptide and modulate its kinase activity; 

b) detecting a kinase activity associated with the SOC/CRAC polypeptide in the 
presence of the candidate agent; and 

c) comparing the kinase activity of step (b) with a control kinase activity of a 
SOC/CRAC polypeptide in the absence of . the candidate agent to determine whether the 
candidate agent modulates SOC/CRAC kinase activity. 

37. The method of claim 36, wherein the SOC/CRAC polypeptide comprises 
amino acids 999-1 180 of the sequence represented as SEQ ID NO:24, or a fragment thereof 
that retains the kinase activity. 
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<110> Beth Israel Deaconess Medical Center, Inc. 
Scharenberg, Andrew 

<120> CHARACTERIZATION OF A CALCIUM CHANNEL FAMILY 

<130> B0662/7026WO/ERP/KA 

<150> U.S. 60/114,220 
<151> 1998-12-30 

<150> U.S. 60/120,018 
<151> 1999-01-29 

<150> U.S. 60/140,415 
<151> 1999-06-22 

<160> 32 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 1212 

<212> DNA 

<213> Homo Sapiens 



<400> 1 

gcacgaggca aattttttgt tagtacacca tctcagccaa gttgcaaaag ccacttggaa 60 

actggaacca aagatcaaga aactgtttgc tctaaagcta cagaaggaga taatacagaa 120 

tttggagcat ttgtaggaca cagagatagc atggatttac agaggtttaa agaaacatca 180 

aacaagataa aaatactatc caataacaat acttctgaaa acactttgaa acgagtgagt 240 

tctcttgctg gatttactga ctgtcacaga acttccattc ctgttcattc aaaacaagaa 300 

aaaatcagta gaaggccatc taccgaagac actcatgaag tagattccaa agcagcttta 360 

ataccggttt gtagatttca actaaacaga tatatattat taaatacatt aaactttttt 420 

agataagatc tacaaagtgg tgatatttgg gactatatca aaaattcaaa aaaatttttc 480 

ttaagaaaac tgactttagc atagtagcag ttacagaaaa gtttcttaca gtgaatagtc 540 

aggaatttta aagaaaaatt tatgcagaat aaaggcagga atctcttttt gtttgaattg 600 

aagctaatta tatgaactca tttccagcta actgcgataa tgattgattt tgcaaattcc 660 

ctttaaaagc acacactgac aagacaaaaa gctcaggaaa aggcagaaaa attactcctt 720 

tataatcaag tattatatat aagtcagtgc tcataatttt gctcaagaaa atattgactt 780 

acattcatat atatctgttc tggcatagag agattatgtt gttaaaatca tgttattgaa 84 0 

aaaagttatt tcagtgggga aagaggttag ttaacaaaga gattcacagt aacaaatcct 900 

cctttctgga gggactcttc ctgaccctga gctgcacaac tttgcaacaa attaaagcct 960 

aaccgaagat gacctcacaa tggcaattta gaactcatgg gagtcaactt acataaacgg 1020 

tatttgattt ctgataagat agtggaatta ttggttatag atgacaaaat aagtatgttt 1080 

aaagtgatga tggacataaa aaagttttaa atataaaaca tgagaaaaga aggagatact 1140 

attcaaaaag actggcaaat ttgaaaaact agaaataaaa aaaaaaaaaa aaaatgagcg 1200 

gccgcaagct tt 1212 



<210> 2 

<211> 141 

<212> PRT 

<213> Homo Sapiens 

<400> 2 

Ala Arg Gly Lys Phe Phe Val Ser Thr Pro Ser Gin Pro Ser Cys Lys 

15 10 15 

Ser His Leu Glu Thr Gly Thr Lys Asp Gin Glu Thr Val Cys Ser Lys 
20 25 30 
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35 


Glv 


Asp 


Asn 


Thr 


Giu 
40 


Phe 


Gly 


Ala 


Phe 


Val 
45 


Gly 


Kis 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


Thr 


Ser 


Asn 


Lys 


lie 


Lys 


50 










55 










60 










He 


Leu 


Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


Glu 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ser 


65 










70 










75 










80 


Ser 


Leu 


Ala 


Gly 


Phe 
85 


Thr 


Asp 


Cys 


His 


Arg 
90 


Thr 


Ser 


lie 


Pro 


Val 
95 


His 


Ser 


Lys 


Gin 


Glu 


Lys 


He 


Ser 


Arg 


Arg 


Pro 


Ser 


Thr 


Glu 


Asp 


Thr 


His 






100 










105 










110 






Glu 


Val 


Asp 
115 


Ser 


Lys 


Ala 


Ala 


Leu 
120 


lie 


Pro 


Val 


Cys 


Arg 
125 


Phe 


Gin 


Leu 


Asn 


Arg 


Tyr 


He 


Leu 


Leu 


Asn 


Thr 


Leu 


Asn 


Phe 


Phe 


Arg 









130 135 140 



<210> 3 
<211> 739 
<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 
<222> (5) ... (5) 
<223> UNKNOWN 

<221> unsure 
<222> (21) . . . (22) 
<223> UNKNOWN 

<221> unsure 
<222> (29) . . . (29) 
<223> UNKNOWN 

<400> 3 
tcgantaggg gtcttccacc 
aattgatgtg tgtgcaaacg 
gactccattt cttcaagcag 
tattgcattt ytcaacaatg 
gtaccagcgt tatcatttta 
tatcattctt agccatatag 
taagacttcc gatggaccaa 
ttttgaagag cagtgtgttg 
gagtgaagag agaattcgtg 
agaagttgga gatccgtgtc 
ttggccattt gcaagatctt 
aaaagcgtcg gaagctagca 
acacttggct caaaacctt 

<210> 4 

<211> 235 

<212> PRT 

<213> Homo Sapiens 

<220> 

<221> UNSURE 
<222> (41) . . . (41) 
<223> UNKNOWN 

<221> UNSURE 
<222> (54) . . . (54) 



nncatactng gatgatggtt ggtgaagtct atgcatacga 60 

attctgttat ccctcaaatc tgtggtcctg ggacgtggtt 120 

tctacctctt tgwacagtat atcattatgg ttaatcttct 180 

tgtatttaca agtgaaggca atttccaata ttgyatggaa 240 

ttatggctta tcatgagaaa ccagttctgc ctcctccact 300 

tttctctgtt ttgctgcata tgtaagagaa gaaagaaaga 360 

aacttttctt aacagaagaa gatcaaaaga aacttcatga 420 

aaatgtattt caatgaaaaa gatgacaaat ttcattctgg 480 

tcacttttga aagagtggaa cagatgtgca ttcagattaa 540 

aactacataa aaagatcatt acaatcatta gattctcaaa 600 

tcagccctga cggtagatac attaaaaaca ctcactggcc 660 

aagttcataa tgaaatcaca cgagaactga gcatttccaa 720 

739 
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<221> UNSURE 
<222> (68) . . . (68) 
<223> UNKNOWN 



<400> 4 



Met 


Met 


Val 


Gly 


Glu 


Val 


Tyr 


Ala 


Tyr 


Glu 


He 


Asp 


Val 


Cys 


Ala 


Asn 


1 








5 










10 










15 




Asp 


Ser 


Val 


He 


Pro 


Gin 


He 


Cys 


Gly 


Pro 


Gly 


Thr 


Trp 


Leu 


Thr 


Pro 






20 










25 










30 






Phe 


Leu 


Gin 


Ala 


Val 


Tyr 


Leu 


Phe 


Xaa 


Gin 


Tyr 


He 


He 


Met 


Val 


Asn 






35 










40 










45 








Leu 


Leu 


lie 


Ala 


Phe 


Xaa 


Asn 


Asn 


Val 


Tyr 


Leu 


Gin 


Val 


Lys 


Ala 


He 




50 










55 










60 










Ser 


Asn 


lie 


Xaa 


Trp 


Lys 


Tyr 


Gin 


Arg 


Tyr 


His 


Phe 


He 


Met 


Ala 


Tyr 


65 










70 










75 










80 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Leu 


Ser 


His 


T 1 a 

1 ±e 








85 










90 










95 




Val 


Ser 


Leu 


Phe 


Cys 


Cys 


He 


Cys 


Lys 


Arg 


Arg 


Lys 


Lys 


Asp 


Lys 


Thr 








100 










105 










110 






Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 


Gin 


Lys 


Lys 


Leu 






115 










120 










125 








His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


Val 


Glu 


Met 


Tyr 


Phe 


Asn 


Glu 


Lys 


Asp 




130 










135 










140 










Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


Phe 


Glu 


145 








150 










155 










160 


Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly Asp 


Pro 


Cys 








165 








170 










175 




Gin 


Leu 


His 


Lys 


Lys 


He 


He 


Thr 


He 


He 


Arg 


Phe 


Ser 


Asn 


Trp 


Pro 








180 








185 










190 






Phe 


Ala 


Arg 


Ser 


Phe 


Ser 


Pro 


Asp 


Gly 


TVrg 


Tyr 


He 


Lys 


Asn 


Thr 


His 






195 










200 










205 








Trp 


Pro 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 


Glu 


He 


Thr 


Arg 


210 










215 










220 










Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 













225 230 235 



<210> 5 

<211> 1579 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 

<222> (368) . . . (368) 

<223> g or c 



<221> unsure 

<222> (372) . . . (372) 

<223> g or c 



<221> unsure 

<222> (374) . . . (374) 

<223> g or a 



<221> unsure 
<222> (375).. 
<223> g or c 



(375) 
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<221> unsure 

<222> (387) . . . (387) 

<221> unsure 

<222> (482) . . . (482) 



<400> 5 

acgtcgcctg caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc ggcatggtgt 60 

tgtaaataca cttagctcct ctcttcctca aggtgatctt gaaagtaata atccrtttca 120 

ttgtaatatt ttaatgaaag atgacaaaga tccccagtgt aatatatttg gtcaagactt 180 

acctgcagta ccccagagaa aagaatttaa ttttccagag gctggttcct cttctggtgc 240 

cttattccca agtgctgttt cccctccaga actgcgacag agactacatg gggtagaact 300 

cttaaaaata tttaataaaa atcaaaaatt aggcagttca tctactagca taccacatct 360 

gtcatccsca csarscaaat tttttgntag tacaccatct cagccaagtt gcaaaagcca 420 

cttggaaact ggaaccaaag atcaagaaac tgtttgctct aaagctacag aaggagataa 480 

tncagaattt ggagcatttg taggacacag agatagcatg gatttacaga ggtttaaaga 540 

aacatcaaac aagataaaaa tactatccaa taacaatact tctgaaaaca ctttgaaacg 600 

agtgagttct cttgctggat ttactgactg tcacagaact tccattcctg ttcattcaaa 660 

acaagaaaaa atcagtagaa ggccatctac cgaagacact catgaagtag attccaaagc 720 

agctttaata ccggtttgta gatttcaact aaacagatat atattattaa atacattaaa 780 

cttttttaga taagatctac aaagtggtga tatttgggac tatatcaaaa attcaaaaaa 840 

atttttctta agaaaactga ctttagcata gtagcagtta cagaaaagtt tcttacagtg 900 

aatagtcagg aattttaaag aaaaatttat gcagaataaa ggcaggaatc tctttttgtt 960 

tgaattgaag ctaattatat gaactcattt ccagctaact gcgataatga ttgattttgc 1020 

aaattccctt taaaagcaca cactgacaag acaaaaagct caggaaaagg cagaaaaatt 1080 

actcctttat aatcaagtat tatatataag tcagtgctca taattttgct caagaaaata 1140 

ttgacttaca ttcatatata tctgttctgg catagagaga ttatgttgtt aaaatcatgt 1200 

tattgaaaaa agttatttca gtggggaaag aggttagtta acaaagagat tcacagtaac 12 60 

aaatcctcct ttctggaggg actcttcctg accctgagct gcacaacttt gcaacaaatt 1320 

aaagcctaac cgaagatgac ctcacaatgg caatttagaa ctcatgggag tcaacttaca 1380 

taaacggtat ttgatttctg ataagatagt ggaattattg gttatagatg acaaaataag 14 4 0 

tatgtttaaa gtgatgatgg acataaaaaa gttttaaata taaaacatga gaaaagaagg 1500 

agatactatt caaaaagact ggcaaatttg aaaaactaga aataaaaaaa aaaaaaaaaa 1560 

atgagcggcc gcaagcttt 1579 



<210> 6 

<211> 243 

<212> PRT 

<213> Homo Sapiens 

<220> 

<221> UNSURE 

<222> (103) . . . (105) 

<223> UNKNOWN 

<221> UNSURE 

<222> (109) . . . (109) 

<223> UNKNOWN 

<221> UNSURE 

<222> (141) . . . (141) 

<223> UNKNOWN 

<400> 6 

Val Asn Thr Leu Ser Ser Ser Leu Pro Gin Gly 

15 10 

Asn Pro Phe His Cys Asn lie Leu Met Lys Asp 

20 25 

Cys Asn lie Phe Gly Gin Asp Leu Pro Ala Val 
35 4 0 



Asp Leu Glu Ser Asn 
15 

Asp Lys Asp Pro Gin 
30 

Pro Gin Arg Lys Glu 
45 



wo 00/40614 



PCT/US99/29996 



Phe 


Asn 


Phe 


Pro 




50 






Ala 


Val 


Ser 


Pro 


65 








Leu 


Lys 


He 


Phe 


He 


Pro 


His 


Leu 








100 


Ser 


Gin 


Pro 


Ser 






115 




Glu 


Thr 


Val 


Cys 




130 






Ala 


Phe 


Val 


Gly 


145 








Thr 


Ser 


Asn 


Lys 


Thr 


Leu 


Lys 


Arg 








180 


Thr 


Ser 


He 


Pro 






195 




Ser 


Thr 


Glu 


Asp 




210 






Val 


Cys 


Arg 


Phe 


225 








Phe 


Phe 


Arg 





Glu Ala Gly Ser 
55 

Pro Glu Leu Arg 
70 

Asn Lys Asn Gin 

85 

Ser Ser Xaa Xaa 

Cys Lys Ser His 
120 

Ser Lys Ala Thr 
135 

His Arg Asp Ser 
150 

He Lys He Leu 
165 

Val Ser Ser Leu 

Val His Ser Lys 
200 

Thr His Glu Val 
215 

Gin Leu Asn Arg 
230 



-5- 

Ser Ser Gly Ala 
60 

Gin Arg Leu His 
75 

Lys Leu Gly Ser 
90 

Xaa Lys Phe Phe 
105 

Leu Glu Thr Gly 

Glu Gly Asp Asn 
140 

Met Asp Leu Gin 
155 

Ser Asn Asn Asn 
170 

Ala Gly Phe Thr 
185 

Gin Glu Lys He 

Asp Ser Lys Ala 
220 

Tyr He Leu Leu 
235 



Leu 


Phe 


Pro 


Ser 


Gly 


Val 


Glu 


Leu 








80 


Ser 


Ser 


Thr 


Ser 






95 




Xaa 


Ser 


Thr 


Pro 




110 






Thr 


Lys 


Asp 


Gin 


1 5^ 








Xaa 


Glu 


Phe 


Gly 


Arg 


Phe 


Lys 


Glu 








160 


Thr 


Ser 


Glu 


Asn 






175 




Asp 


Cys 


His 


Arg 




190 






Ser 


Arg 


Arg 


Pro 


205 








Ala 


Leu 


He 


Pro 


Asn 


Thr 


Leu 


Asn 








240 



<210> 7 

<211> 3532 

<212> DNA 

<213> Mus Musculus 

<220> 

<221> unsure 

<222> (2420) . . . (2420) 

<223> unknown 

<221> unsure 

<222> (2434) . . . (2434) 

<223> unknown 

<221> unsure 

<222> (2461) . . . (2461) 

<223> unknown 

<221> unsure 

<222> (2466) . . . (2466) 

<223> unknown 

<221> unsure 

<222> (2470) . . . (2470) 

<223> unknown 



<400> 7 

attatggctt atcatgaaaa accagtcctg cctcctcctc ttatcatcct cagccatata 60 

gtttcactgt tttgctgtgt atgcaaaaga agaaagaaag ataagacttc cgatgggcca 120 

aaacttttct taacagaaga agatcaaaag aaactccatg attttgaaga gcagtgtgtt 180 

gagatgtact ttgatgagaa agatgacaaa ttcaattctg ggagtgaaga gagaatccgg 240 

gtcacttttg aaagagtgga gcagatgagc attcagatta aagaagttgg agatcgtgtc 300 

aactacataa aaagatcatt acagtcttta gattctcaaa ttggtcatct gcaagatctc 360 
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tcagccctaa cagtagatac attgaaaaca cttacagccc agaaagcttc agaagctagt 4 20 

aaagtgcaca atgagatcac acgagaati:g agtatttcca aacacttggc tcagaatctt 4 80 

attgatgatg ttcctgtaag acctttgtgg gaagaaccta gtgctgtaaa cacactgagt 540 

tcctctcttc ctcaaggtga tcgggaaagt aataatcctt ttctttgtaa tatttttatg 600 

aaagatgaaa aagaccccca atataatctg trtggacaag atttgcccgt gataccccag 660 

agaaaagaat tcaacattcc agaggctggt tcctcctgtg gtgccttatt cccaagtgct 720 

gtttctcccc cagaattacg acagagacga catggggtag aaatgttaaa aatatttaat 780 

aaaaatcaaa aattaggcag ttcacctaat agttcaccac atatgtcctc cccaccaacc 840 

aaattttctg tgagtacccc atcccagcca agttgcaaaa gtcacttgga atccacaacc 900 

aaagatcaag aacccatttt ctataaagct gcagaagggg ataacataga atttggagca 960 

tttgtgggac acagagatag tatggactta cagaggttta aagaaacatc aaacaaaata 1020 

agagaactgt tatctaatga tactcctgaa aacactctga aacatgtggg tgctgctgga 1080 

tatagtgaat gttgtaagac ttctacttct ctrcactcgg tgcaagcaga aagctgtagt 1140 

agaagagcgt cgacggaaga ctctccagaa gtcgattcta aagcagcttt gttaccggat 1200 

tggttacgag atagaccatc aaacagagaa atgccatctg aaggaggaac attaaatggt 1260 

cttgcttctc catttaagcc cgttttggat acaaattact attattcagc tgtggaaaga 1320 

aataacctga tgaggttgtc acagagtatt cccttcgttc ctgtacctcc acgaggcgag 1380 

cctgtcacag tgtaccgtct ggaggagagt tctcccagta tactgaataa cagcatgtct 144*0 

tcatggtctc agctaggcct ctgtgccaaa atrgagtttt taagtaaaga ggaaatggaa 1500 

ggtggtttac gaagagcagt caaagtgctg tgracctggt cagagcacga tatcctgaag 1560 

tcagggcatc tcratatcat taagtcattt ctrcctgagg tgataaacac atggtcaagc 1620 

atttataaag aagatacggt tctacatctc tgtctcagag aaatacaaca acagagagca 1680 

gcacaaaagc tcacatttgc ctttaatcag atgaaaccca aatccatacc atattctcca 1740 

aggttccttg aagtrttcct gttgtactgc cattcagcag ggcagtggtt tgctgtagaa 1800 

gagtgcatga ctggtgaatt tagaaaatac aacaacaata atggtgatga aatcattcct 18 60 

acaaatactc tagaagagat catgctagcc tttagccact ggacctatga atataccaga 1920 

ggggagttac tggtacttga cttacaagga gtgggagaaa acttgactga cccatctgta 1980 

ataaaagctg aagaaaaaag atcctgtgac atggtttttg gccctgccaa tctaggagaa 204 0 

gatgcaataa aaaacttcaa gagccaaaca tccactgtaa ttcttgctgt cgaaagctta 2100 

aacttcccag atttgaagag gaatgactac acgcccttga taaaattata tttcctcagg 2160 

atgagtcatc agatttgaat cttcaatctg gaaattccac caaagaatca gaagcaacaa 2220 

attctgttcg tctgatgtta tagtgctgag tcattggttt ttgcctacac ttcacaaaag 2280 

tgtaactgtc agttttcctt tcgggggaat tgatgatata ggaagatgtg tgcaaaatga 234 0 

gcttgctggc cccacacata gtctagaggt aatgttctca ttgaaaaacg cctggaggtg 2400 

gaggctgcag atgccagtgn aaagtgctag ctgncagaga gtcagtgctc tcgggctggt 24 60 

naaggncggn acccttgctg ctgagagtgg tggttctctt cacctggtgc aggaccatta 2520 

accaaagtca agtcttcaga tttgattggc tgctcagtca cagcccattc agctaaggaa 2580 

actaaattgc gcagcttttt aaatggctga agtcttcctc agtttgtgct ctatgataat 2 640 

gatgttagct ctcaactagg tgtttgtggc cacgggagaa ctactcctta caattttgct 2700 

tcacaggcat gttacaaagc ctgcactgaa aaccgtttgt cttccctctc tccctccctc 27 60 

ttttccctgt agtattgagg atcaaaccca gggcctcatg aagaccattt tctaagagac 2820 

attttattta agaatcaact atagagtcta tgtttatgga tacagccagt ttttgttaaa 2880 

caaaacctga attgtgcaaa agggtttttt aacatttatc aatgttaagt aaaagaaagc 2940 

catgataaat aagaattaac tcactgttca atgggtgttt cctgtgagga aggttacagt 3000 

tgtaacagcc tgcagttgca tacatctcca aagatttaca gacttagtgt atcaaatcag 3060 

agtgtcatgt gagctctcac attgaaaatt ctataggaat gtgtcaatgt gaattctatt 3120 

tctggractt aagaaatcag ttgttggatt atccttatac agtataggga gatcacaata 3180 

caactttatg ccaataaaat ctaacttaat tgcccagata tttttgcata tttagcaaca 3240 

agaaaagctt atcatttgac tcaagtttta tgctttctct ttcttttcat ttcctaggta 3300 

ctaattttaa tttttatttg gaaggagcag tgtaaagctt acttgtattc aatagtgtat 3360 

ctcatagata cagacaaggc cgcagagata agctgttaaa tagtgtttaa tgttgatgtg 3420 

gagagaaagg tgtattactt aaaaatacta taccatatac gttttgtata tcattaaatc 3480 

tttaaaagaa attaaattta ttcttgttta aaaaaaaaaa aaaaaaaaaa aa 3532 

<210> 8 

<211> 475 

<212> PRT 

<213> Mus Musculus 



<400> 8 
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lie 


Met 


Ala 


Tyr 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


1 








5 










10 










15 




Leu 


Ser 


His 


He 


Val 


Ser 


Leu 


Phe 


Cys 


Cys 


Val 


Cys 


Lys 


Arg 


Arg 


Lys 








20 










25 










30 






Lys 


Asp 


Lys 


Thr 


Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 






35 










40 










45 








Gin 


Lys 


Lys 


Leu 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


Val 


Glu 


Met 


Tyr 


Phe 




50 










55 










60 










Asp 


Glu 


Lys 


Asp Asp 


Lys 


Phe 


Asn 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


65 










70 










75 










80 


Val 


Thr 


Phe 


Glu 


Arg 


Val 


Glu 


Gin 


Met 


Ser 


He 


Gin 


He 


Lys 


Glu 


Val 










85 










90 










95 




Gly Asp Arg 


Val 


Asn 


Tyr 


He 


Lys 


Arg 


Ser 


Leu 


Gin 


Ser 


Leu 


Asp 


Ser 








100 










105 










110 






Gin 


He 


Gly 


His 


Leu 


Gin 


Asp 


Leu 


Ser 


Ala 


Leu 


Thr 


Val 


Asp 


Thr 


Leu 






115 










120 










125 








Lys. Thr 


Leu 


Thr 


Ala 


Gin 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 




130 










135 










140 










Glu 


He 


Thr 


Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 


Leu 


145 










150 










155 










160 


He 


Asp 


Asp 


Val 


Pro 


Val 


Arg 


Pro 


Leu 


Trp 


Glu 


Glu 


Pro 


Ser 


Ala 


Val 










165 










170 










175 




Asn 


Thr 


Leu 


Ser 


Ser 


Ser 


Leu 


Pro 


Gin 


Gly Asp 


Arg 


Glu 


Ser 


Asn 


Asn 








180 










185 










190 






Pro 


Phe 


Leu 


Cys 


Asn 


He 


Phe 


Met 


Lys 


Asp 


Glu 


Lys 


Asp 


Pro 


Gin 


Tyr 






195 










200 










205 








Asn 


Leu 


Phe 


Gly Gin 


Asp 


Leu 


Pro 


Val 


He 


Pro 


Gin 


Arg 


Lys 


Glu 


Phe 




210 










215 










220 










Asn 


He 


Pro 


Glu 


Ala 


Gly 


Ser 


Ser 


Cys 


Gly Ala 


Leu 


Phe 


Pro 


Ser 


Ala 


225 










230 










235 










240 


Val 


Ser 


Pro 


Pro 


Glu 


Leu 


Arg 


Gin 


Arg 


Arg 


His 


Gly 


Val 


Glu 


Met 


Leu 










245 










250 










255 




Lys 


He 


Phe 


Asn 


Lys 


Asn 


Gin 


Lys 


Leu 


Gly 


Ser 


Ser 


Pro 


Asn 


Ser 


Ser 








260 










265 










270 






Pro 


His 


Met 


Ser 


Ser 


Pro 


Pro 


Thr 


Lys 


Phe 


Ser 


Val 


Ser 


Thr 


Pro 


Ser 






275 










280 










285 








Gin 


Pro 


Ser 


Cys 


Lys 


Ser 


His 


Leu 


Glu 


Ser 


Thr 


Thr 


Lys 


Asp 


Gin 


Glu 




290 










295 










300 










Pro 


He 


Phe 


Tyr 


Lys 


Ala 


Ala 


Glu 


Gly 


Asp 


Asn 


He 


Glu 


Phe 


Gly 


Ala 


305 










310 










315 










320 


Phe 


Val 


Gly 


His 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


Thr 










325 










330 










335 




Ser 


Asn 


Lys 


He 


Arg 


Glu 


Leu 


Leu 


Ser 


Asn 


Asp 


Thr 


Pro 


Glu 


Asn 


Thr 








340 










345 










350 






Leu 


Lys 


His 


Val 


Gly 


Ala 


Ala 


Gly 


Tyr 


Ser 


Glu 


Cys 


Cys 


Lys 


Thr 


Ser 






355 










360 










365 








Thr 


Ser 


Leu 


His 


Ser 


Val 


Gin 


Ala 


Glu 


Ser 


Cys 


Ser 


Arg 


Arg 


Ala 


Ser 




370 










375 










380 










Thr 


Glu 


Asp 


Ser 


Pro 


Glu 


Val 


Asp 


Ser 


Lys 


Ala 


Ala 


Leu 


Leu 


Pro 


Asp 


385 










390 










395 










400 


Trp 


Leu 


Arg 


Asp 


Arg 


Pro 


Ser 


Asn 


Arg 


Glu 


Met 


Pro 


Ser 


Glu 


Gly 


Gly 










405 










410 










415 




Thr 


Leu 


Asn 


Gly 


Leu 


Ala 


Ser 


Pro 


Phe 


Lys 


Pro 


Val 


Leu 


Asp 


Thr 


Asn 








420 










425 










430 






Tyr 


Tyr 


Tyr 


Ser 


Ala 


Val 


Glu Arg Asn 


Asn 


Leu 


Met 


Arg 


Leu 


Ser 


Gin 






435 










440 










445 








Ser 


He 


Pro 


Phe 


Val 


Pro 


Val 


Pro 


Pro 


Arg 


Gly Glu 


Pro 


Val 


Thr 


Val 




450 










4 55 










4 60 










Tyr 


Pro 


Ser 


Gly 


Gly Arg 


Val 


Leu 


Pro 


Val 


Tyr 












465 










470 










475 
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<211> 5433 

<212> DNA 

<213> Mus Musculus 

<220> 

<221> unsure 

<222> (5094) . . . (5094) 

<223> unknown 



<4 00> 9 

ggctgaaaga gcctgagctg tgcctctcca ttccactgct gtggcagggt cagaaatctt 60 

ggatagagaa aaccttttgc aaacgggaat gtatctttgt aattcctagc acgaaagact 120 

ctaacaggtg ttgctgtggc cagttcacca accagcatat cccccctctg ccaagtgcaa 180 

cacccagcaa aaatgaagag gaaagcaaac aggtggagac tcagcctgag aaatggtctg 240 

ttgccaagca cacccagagc tacccaacag attcctatgg agttcttgaa ttccagggtg 300 

gcggatattc caataaagcc atgtatatcc gtgtatccta tgacaccaag ccagactcac 360 

tgctccatct catggtgaaa gattggcagc tggaactccc caagctctta atatctgtgc 420 

atggaggcct ccagaacttt gagatgcagc ccaagctgaa acaagtcttt gggaaaggcc 4 80 

tgatcaaggc tgctatgacc accggggcct ggatcttcac cgggggtgtc agcacaggtg 540 

ttatcagcca cgtaggggat gccttgaaag accactcctc caagtccaga ggccgggttt 600 

gtgctatagg aattgctcca tggggcatcg tggagaataa ggaagacctg gttggaaagg 660 

atgtaacaag agtgtaccag accatgtcca accctctaag taagctctct gtgctcaaca 720 

actcccacac ccacttcatc ctggctgaca atggcaccct gggcaagtat ggcgccgagg 780 

tgaagctgcg aaggctgctg gaaaagcaca tctccctcca gaagatcaac acaagactgg 840 

ggcagggcgt gcccctcgtg ggtctcgtgg tggagggggg ccctaacgtg gtgtccatcg 900 

tcttggaata cctgcaagaa gagcctccca tccctgtggt gatttgtgat ggcagcggac 960 

gtgcctcgga catcctgtcc tttgcgcaca agtactgtga agaaggcgga ataataaatg 1020 

agtccctcag ggagcagctt ctagttacca ttcagaaaac atttaattat aataaggcac 1080 

aatcacatca gctgtttgca attataatgg agtgcatgaa gaagaaagaa ctcgtcactg 114 0 

tgttcagaat gggttctgag ggccagcagg acatcgagat ggcaatttta actgccctgc 1200 

tgaaaggaac aaacgtatct gctccagatc agctgagctt ggcactggct tggaaccgcg 12 60 

tggacatagc acgaagccag atctttgtct ttgggcccca ctggacgccc ctgggaagcc 1320 

tggcaccccc gacggacagc aaagccacgg agaaggagaa gaagccaccc atggccacca 1380 

ccaagggagg aagaggaaaa gggaaaggca agaagaaagg gaaagtgaaa gaggaagtgg 1440 

aggaagaaac tgacccccgg aagatagagc tgctgaactg ggtgaatgct ttggagcaag 1500 

cgatgctaga tgctttagtc ttagatcgtg tcgactttgt gaagctcctg attgaaaacg 1560 

gagtgaacat gcaacacttt ctgaccattc cgaggctgga ggagctctat aacacaagac 1620 

tgggtccacc aaacacactt catctgctgg tgagggatgt gaaaaagagc aaccttccgc 1680 

ctgattacca catcagcctc atagacatcg ggctcgtgct ggagtacctc atgggaggag 1740 

cctaccgctg caactacact cggaaaaact ttcggaccct ttacaacaac ttgtttggac 1800 

caaagaggcc taaagctctt aaacttctgg gaatggaaga tgatgagcct ccagctaaag 18 60 

ggaagaaaaa aaaaaaaaag aaaaaggagg aagagatcga cattgatgtg gacgaccctg 1920 

ccgtgagtcg gttccagtat cccttccacg agctgatggt gtgggcagtg ctgatgaaac 1980 

gccagaaaat ggcagtgttc ctctggcagc gaggggaaga gagcatggcc aaggccctgg 2040 

tggcctgcaa gctctacaag gccatggccc acgagtcctc cgagagtgat ctggtggatg 2100 

acatctccca ggacttggat aacaattcca aagacttcgg ccagcttgct ttggagttat 2160 

tagaccagtc ctataagcat gacgagcaga tcgctatgaa actcctgacc tacgagctga 2220 

aaaactggag caactcgacc tgcctcaaac tggccgtggc agccaaacac cgggacttca 2280 

ttgctcacac ctgcagccag atgctgctga ccgatatgtg gatgggaaga ctgcggatgc 2340 

ggaagaaccc cggcctgaag gttatcatgg ggattcttct accccccacc atcttgtttt 2400 

tggaatttcg cacatatgat gatttctcgt atcaaacatc caaggaaaac gaggatggca 24 60 

aagaaaaaga agaggaaaat acggatgcaa atgcagatgc tggctcaaga aagggggatg 2520 

aggagaacga gcataaaaaa cagagaagta ttcccatcgg aacaaagatc tgtgaattct 2580 

ataacgcgcc cattgtcaag ttctggtttt acacaatatc atacttgggc tacctgctgc 2640 

tgtttaacta cgtcatcctg gtgcggatgg atggctggcc gtccctccag gagtggatcg 2700 

tcatctccta catcgtgagc ctggcgttag agaagatacg agagatcctc atgtcagaac 2760 

caggcaaact cagccagaaa atcaaagttt ggcttcagga gtactggaac atcacagatc 2820 

tcgtggccat ttccacattc atgattggag caattcttcg cctacagaac cagccctaca 2880 
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tgggctatgg ccgggtgatc tactgtgtgg atatcatctt ctggtacatc cgtgtcctgg 2 94 0 

acatctttgg tgtcaacaag tatctggggc catacgtgat gatgattgga aagargatga 3000 

tcgacatgct gtactttgtg gtcatcatgc tggtcgtgct catgagtttc ggagtagccc 3060 

gtcaagccat tctgcatcca gaggagaagc cctcttggaa actggcccga aacatcttct 3120 

acatgcccta ctggatgatc tatggagagg tgtttgcaga ccagatagac ctctacgcca 3180 
tggaaattaa tcctccttgt ggtgagaacc tatatgatga ggagggcaag cggcttcctc 3240 
cctgtatccc cggcgcctgg ctcactccag cactcatggc gtgctatcta ctggtcgcca 3300 

acatcctgct ggtgaacctg ctgattgctg tgttcaacaa tactttcttt gaagtaaaat 3360 

caatatccaa ccaggtgtgg aagttccagc gatatcagct gattatgaca tttcatgaca 3420 
ggccagtcct gcccccaccg atgatcattt taagccacat ctacatcatc attatgcgtc 3480 
tcagcggccg ctgcaggaaa aagagagaag gggaccaaga ggaacgggat cgtggattga 3540 

agctcttcct tagcgacgag gagctaaaga ggctgcatga gttcgaggag cagtgcgtgc 3600 

aggagcactt ccgggagaag gaggatgagc agcagtcgtc cagcgacgag cgcatccggg 3660 

tcacttctga aagagttgaa aatatgtcaa tgaggttgga agaaatcaat gaaagagaaa 3720 

cttttatgaa aacttccctg cagactgttg accttcgact tgctcagcta gaagaattat 3780 

ctaacagaat ggtgaatgct cttgaaaatc ttgcgggaat cgacaggtct gacctgatcc 3840 

aggcacggtc ccgggcttct tctgaatgtg aggcaacgta tcttctccgg caaagcagca 3900 

tcaatagcgc tgatggctac agcttgtatc gatatcattt taacggagaa gagttattat 3960 

ttgaggatac atctctctcc acgtcaccag ggacaggagt caggaaaaaa acctgttcct 4020 

tccgtataaa ggaagagaag gacgtgaaaa cgcacctagt cccagaatgt cagaacagtc 4 080 

ttcacctttc actgggcaca agcacatcag caaccccaga tggcagtcac cttgcagtag 414 0 

atgacttaaa gaacgctgaa gagtcaaaat taggtccaga tattgggatt tcaaaggaag 4 200 

atgatgaaag acagacagac tctaaaaaag aagaaactat ttccccaagt ttaaataaaa 4 2 60 

cagatgtgat acatggacag gacaaatcag atgttcaaaa cactcagcta acagtggaaa 4 320 

cgacaaatat agaaggcact atttcctatc ccctggaaga aaccaaaatt acacgctatt 4 380 

tccccgatga aacgatcaat gcttgtaaaa caatgaagtc cagaagcttc gtctattccc 4 44 0 

ggggaagaaa gctggtcggt ggggttaacc aggatgtaga gtacagttca atcacggacc 4 500 

agcaattgac gacggaatgg caatgccaag ttcaaaagat cacgcgctct catagcacag 4 560 

atattcctta cattgtgtcg gaagctgcag tgcaagctga gcaaaaagag cagtttgcag 4 620 

atatgcaaga tgaacaccat gtcgctgaag caattcctcg aatccctcgc ttgtccctaa 4 680 

ccattactga cagaaatggg atggaaaact tactgtctgt gaagccagat caaactttgg 474 0 

gattcccatc tctcaggtca aaaagtttac atggacatcc taggaatgtg aaatccattc 4800 

agggaaagtt agacagatct ggacatgcca gtagtgtaag cagcttagta attgtgtctg 4860 

gaatgacagc agaagaaaaa aaggttaaga aagagaaagc ttccacagaa actgaatgct 4 920 

agtctgtttt gtttctttaa tttttttttt taacagtcag aaacccacta atgggtgtca 4 980 

tcttggccca tcctaaacac atmtccaatt tcctaaaaac attttccctt aaaaaatttt 5040 

ggaaattcag acttgattta caatttaatg cactaaaagt agtattttgt tagnatatgt 5100 

tagtaggctt agttttttca gttgcagtag tatcaaatga aagtgatgat actgtaacga 5160 

agataaattg gctaatcagt atacaagatt atacaatctc tttattactg agggccacca 5220 

aatagcctag gaagtgccct cgagcactga agtcaccatt aggtcactca agaagtaagc 5280 

aactagctgg gcacagtggc tcatgcctgt aatcctagca ctttgggagg ccaaggcaga 534 0 

aagatagctt gagtccagga gtttgagacc agcctgggca acatagtgat accccatctc 5400 

ttaaaaaaaa aaaaaaaaaa ctgccctcgt gcc 5433 
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Met Val 
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Val 


His Gly 
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Gin 
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Phe 




35 








40 


Val 


Phe Gly 


Lys Gly 


Leu 


He 


Lys 




50 






55 





Thr Lys Pro Asp Ser Leu Leu His 
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Glu Leu Pro Lys Leu Leu He Ser 
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He 


Phe 


Thr 


Gly 


Gly 


Val 


Ser 


Thr 


Gly 


val 


He 


Ser 


His 


Val 


Gly 


Asp 


65 










70 










75 










80 


Ala 


Leu 


Lys 


Asp 


His 


Ser 


Ser 


Lys 


Ser 


Arg 


Gly 


Arg 


Val 


Cys 


Ala 


He 








85 










90 










95 




Gly 


He 


Ala 


Pro 


Trp 


Gly 


He 


Val 


Glu 


Asn 


Lys 


Glu 


Asp 


Leu 


Val 


Gly 






100 










105 










110 






Lys 


Asp 


Val 


Thr 


Arg 


Val 


Tyr 


Gin 


Thr 


Met 


Ser 


Asn 


Pro 


Leu 


Ser 


Lys 


115 










120 










125 








Leu 


Ser 
130 


Val 


Leu 


Asn 


Asn 


Ser 
135 


His 


Thr 


His 


Phe 


He 
140 


Leu 


Ala 


Asp 


Asn 


Gly 


Thr 


Leu 


Gly Lys 


Tyr Gly Ala 


Glu 


Val 


Lys 


Leu 


Arg 


Arg 


Leu 


Leu 


145 




















155 
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Lys 


His 
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Leu 


f^l n 

OXIl 


Lys 


He 


Asn 


Thr 


Arg 


Leu 


Gly 


Gin 


Gly 








X 03 










170 










175 




Val 


Pro 


Leu 


V a X 
n on 

J.DU 


Gly 


Leu 


vox 


Val 


Glu 
185 


Gly 


Gly 


Pro 


Asn 


Val 
190 


Val 


Ser 


He 


Val 


Leu 
195 


Glu 


Tyr 


Leu 


Gin 


Glu 


Glu 


Pro 


Pro 


He 


Pro 
205 


Val 


Val 


He 


Cys 


Asp 
210 


Gly 


Ser 


Gly 


Arg 


AX a 
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C A V- 

oer 


Asp 


He 


Leu 


Ser 
220 


Phe 


Ala 


His 


Lys 


Tyr 


Cys 


Glu 


oXU 
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Gly 


xxe 


Tie 
X Xe 


Asn 


Glu 


Ser 


Leu Arg 


Glu 


Gin 


Leu 


225 








230 










235 










240 


Leu 


Val 


Thr 


He 


Gin 
245 


Lys 


Thr 


Phe 


Asn 


Tyr 
250 


Asn 


Lys 


Ala 


Gin 


Ser 
255 


His 


Gin 


Leu 


Phe 


AX a 


He 


He 




m n 

OX U 


Cys 
265 


Met 


Lys 


Lys 


Lys 


Glu 
270 


Leu 


Val 


Thr 


Val 


Phe 


Arg 


Met 


Gly 


Ser 


Glu 


Gly Gin 


Gin 


Asp 


He 


Glu 


Met 


Ala 






275 










280 










285 








He 


Leu 
290 


Thr 


AX a 


Leu 


Leu 


Lys 


vjxy 


Thr 


Asn 


Val 


Ser 
300 


Ala 


Pro 


Asp 


Gin 


Leu 


Ser 


Leu 


Ala 


Leu 


Ala 


Trp 


Asn 


Arg 


Val 


Asp 


He 


Ala 


Arg 


Ser 


Gin 


305 










310 










315 










320 


He 


Phe 


Val 


IT lie 


Gly 
325 


Pro 


His 


T m 
1 


Thr 


Pro 
330 


Leu 


Gly 


Ser 


Leu 


Ala 
335 


Pro 


Pro 


Thr 


Asp 


Ser 


Lys 


Ala 


Thr 


Glu 


Lys 


Glu 


Lys 


Lys 


Pro 


Pro 


Met 


Ala 
















345 










350 






Thr 


Thr 


Lys 


Gly 


Gly Arg 


Gly 


Lys 


Gly 


Lys 


Gly 


Lys 


Lys 


Lys 


Gly 


Lys 






355 










360 










365 








Val 


Lys 
370 


Glu 


Glu 


Val 


Glu 


Glu 
375 


Glu 


Thr 


Asp 


Pro 


Arg 
380 


Lys 


He 


Glu 


Leu 


Leu 


Asn 


Trp 


Val 


Asn 


Ala 


Leu 


Glu 


Gin 


Ala 


Met 


Leu 


Asp 


Ala 


Leu 


Val 


385 








390 










395 
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Leu 


Asp 


Arg 


Val 


Asp 


Phe 


Val 


Lys 


Leu 


Leu 


He 


Glu 


Asn 


Gly 


Val 


Asn 








405 










410 










415 




Met 


Gin 


His 


Phe 
420 


Leu 


Thr 


He 
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Arg 
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Leu 


Glu 
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Tyr 
430 


Asn 


Thr 


Arg 


Leu 


Gly 


Pro 


Pro 


Asn 


Thr 


Leu 


His 


Leu 


Leu 


Val 


Arg 


Asp 


Val 


Lys 




435 










440 










445 








Lys 


Ser 


Asn 


Leu 


Pro 


Pro 


Asp 


Tyr 


His 


He 


Ser 


Leu 


He 


Asp 


He 


Gly 


450 










455 










460 










Leu 


Val 
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Glu 


Tyr 


Leu 


Met 
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Gly 
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Tyr 


Arg 


Cys 


Asn 


Tyr 


Thr 


465 








470 
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Lys 


Asn 


Phe 


Arg 
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Thr 


Leu 
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Asn 


Asn 
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Lys 


Leu 
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Lys 
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He 
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He 
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Arg 


Phe 
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Pro 
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His 


Glu 
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Val 


Trp 


Ala 
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Leu 
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Met 
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545 










550 
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Lys 
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590 






Asp 
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He 


Ser 


Gin 
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880 


Tyr 


Leu 


Gly 


Pro 


Tyr 


Val 


Met 


Met 


He 


Gly 


Lys 


Met 


Met 


He 


Asp 


Met 










885 










890 










895 




Leu 


Tyr 


Phe 


Val 


Val 


He 


Met 


Leu 


Val 


Val 


Leu 


Met 


Ser 


Phe 


Gly 


Val 








900 










905 










910 






Ala 


Arg 


Gin 


Ala 


He 


Leu 


His 


Pro 


Glu 


Glu 


Lys 


Pro 


Ser 


Trp 


Lys 


Leu 






915 










920 










925 








Ala 


Arg 


Asn 


He 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


He 


Tyr 


Gly 


Glu 


Val 




930 










935 










940 










Phe 


Ala 


Asp 


Gin 


He 


Asp 


Leu 


Tyr 


Ala 


Met 


Glu 


He 


Asn 


Pro 


Pro 


Cys 


945 










950 










955 










960 


Gly 


Glu 


Asn 


Leu 


Tyr 


Asp 


Glu 


Glu 


Gly 


Lys 


Arg 


Leu 


Pro 


Pro 


Cys 


He 










965 










970 










975 




Pro 


Gly 


Ala 


Trp 


Leu 


Thr 


Pro 


Ala 


Leu 


Met 


Ala 


Cys 


Tyr 


Leu 


Leu 


Val 








980 










985 










990 






Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


Ala 


Val 


Phe 


Asn 


Asn 


Thr 






995 










1000 








1005 






Phe 


Phe 


Glu 


Val 


Lys 


Ser 


He 


Ser 


Asn 


Gin 


Val 


Trp 


Lys 


Phe 


Gin 


Arg 




1010 








1015 








1020 
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Tyr Gin Leu lie Met Thr Phe His Asp Arg Pro Val Leu Pro Pro Pro 
1025 1030 1035 104 

Met lie lie Leu Ser His lie Tyr lie lie lie Met Arg Leu Ser Gly 

1045 1050 1055 

Arg Cys Arg Lys Lys Arg Glu Gly Asp Gin Glu Glu Arg Asp Arg Gly 

1060 1065 1070 

Leu Lys Leu Phe Leu Ser Asp Glu Glu Leu Lys Arg Leu His Glu Phe 

1075 1080 1085 

Glu Glu Gin Cys Val Gin Glu His Phe Arg Glu Lys Glu Asp Glu Gin 

1090 1095 1100 

Gin Ser Ser Ser Asp Glu Arg lie Arg Val Thr Ser Glu Arg Val Glu 
1105 1110 1115 112 

Asn Met Ser Met Arg Leu Glu Glu lie Asn Glu Arg Glu Thr Phe Met 

1125 1130 1135 

Lys Thr Ser Leu Gin Thr Val Asp Leu Arg Leu Ala Gin Leu Glu Glu 

1140 1145 1150 

Leu Ser Asn Arg Met Val Asn Ala Leu Glu Asn Leu Ala Gly lie Asp 

1155 1160 1165 

Arg Ser Asp Leu lie Gin Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu 

1170 1175 1180 

Ala Thr Tyr Leu Leu Arg Gin Ser Ser lie Asn Ser Ala Asp Gly Tyr 
1185 1190 1195 120 

Ser Leu Tyr Arg Tyr His Phe Asn Gly Glu Glu Leu Leu Phe Glu Asp 

1205 1210 1215 

Thr Ser Leu Ser Thr Ser Pro Gly Thr Gly Val Arg Lys Lys Thr Cys 

1220 1225 1230 

Ser Phe Arg lie Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro 

1235 1240 1245 

Glu Cys Gin Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser Ala 

1250 1255 1260 

Thr Pro Asp Gly Ser His Leu Ala Val Asp Asp Leu Lys Asn Ala Glu 
1265 1270 1275 128 

Glu Ser Lys Leu Gly Pro Asp lie Gly lie Ser Lys Glu Asp Asp Glu 

1285 1290 1295 

Arg Gin Thr Asp Ser Lys Lys Glu Glu Thr lie Ser Pro Ser Leu Asn 

1300 1305 1310 

Lys Thr Asp Val He His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr 

1315 1320 1325 

Gin Leu Thr Val Glu Thr Thr Asn He Glu Gly Thr He Ser Tyr Pro 

1330 1335 1340 

Leu Glu Glu Thr Lys He Thr Arg Tyr Phe Pro Asp Glu Thr He Asn 
1345 1350 1355 136 

Ala Cys Lys Thr Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg 

1365 1370 1375 

Lys Leu Val Gly Gly Val Asn Gin Asp Val Glu Tyr Ser Ser He Thr 

1380 1385 1390 

Asp Gin Gin Leu Thr Thr Glu Trp Gin Cys Gin Val Gin Lys He Thr 

1395 1400 1405 

Arg Ser His Ser Thr Asp He Pro Tyr He Val Ser Glu Ala Ala Val 

1410 1415 1420 

Gin Ala Glu Gin Lys Glu Gin Phe Ala Asp Met Gin Asp Glu His His 
1425 1430 1435 144 

Val Ala Glu Ala He Pro Arg He Pro Arg Leu Ser Leu Thr He Thr 

1445 1450 1455 

Asp Arg Asn Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr 

1460 1465 1470 

Leu Gly Phe Pro Ser Leu Arg Ser Lys Ser Leu His Gly His Pro Arg 

1475 1480 1485 

Asn Val Lys Ser He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser 
1490 1495 1500 
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Ser Val Ser Ser Leu Val He Val Ser Gly Met Thr Ala Glu Glu Lys 
1505 1510 1515 152 

Lys Val Lys Lys Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1525 1530 

<210> 11 

<211> 6220 

<212> DNA 

<213> Homo Sapiens 



<400 
tgtgcagaat 
ttccctgtgc 
tgcagtcacc 
actcccccag 
gtgcaggctg 
ctctcagaac 
tctctagaac 
ttgcagctgg 
gcaggaggag 
tctccggcgc 
caatgacaag 
cgtgtatttt 
ctacacgcat 
gtgggaccca 
cacgggcctg 
cgtgatctac 
ggtgaccggg 
aggcctggtc 
cggcgtcatg 
ggaaggcgag 
cctgatccat 
gaacctgacc 
cggccagtac 
gaccaaggaa 
cggcccgggc 
tgtggagggc 
ggacatcact 
gaccttcacg 
gcggcagctg 
catcttgcag 
ggaccaccag 
cttcatggat 
catctccaac 
gtttgtcacc 
ccacagcaag 
ggcgccccgc 
cacgcagccg 
tccccacgtc 
aggccatgtg 
gaaccgtcgg 
cttggcctgc 
ggaggagatg 
gtgctaccgg 
ggggaagacc 
cgggggcatc 
gctgtggcgt 
cttcagggag 
caccgcaccc 
gttcgcctac 



> 11 

tgtacagttg 
ggtgctcagt 
tgaggttgtt 
agtgtccggg 
atgaagagaa 
atcagcctca 
cccagtgtag 
cgtgggggtc 
ggctttgagg 
agcaacagca 
caagaaagcc 
gtggaaagtt 
gagcagcact 
aagaaacatg 
agccagaagg 
cacctcatga 
ggggccaaga 
aaggtggctc 
aagcaggtag 
ctcatcacca 
cccacgggca 
tgcctagaca 
ggggtggaga 
agaggaggtg 
acgttgcaca 
tcgggccgcg 
atctccctga 
gaaagcagga 
ctgactgtct 
gccttgctga 
ctgaaactgg 
gagtggcagt 
aagcctgagt 
tgggacacct 
ctgcaaaagg 
ctgcagatgc 
ctttatcccc 
aagctcaacg 
accttcacca 
gagctggcag 
agcaagatcc 
ctggcgctgg 
aaggacgaag 
acctgcctgc 
caggccttcc 
gtgaccctgt 
aagaggctgc 
gtggtggtct 
gtgctcatgg 



cgaaaccatg 
gcatctgcac 
accattatga 
ggccccagcc 
ccggatgagg 
tcttcctgtc 
cgagctggag 
tcagaatgga 
ggctgcccag 
gcctcttcaa 
tcagttcgtg 
ccaaactgtc 
tggaggaggc 
tccaggagat 
tgaaaaagta 
cccagcactg 
acttcaacat 
agaccacagg 
gcgaggcggt 
tcggagtcgc 
gcttccccgc 
gcaaccactc 
ttcctctgag 
tggccatcaa 
ccatcgacaa 
tggccgacgt 
tccagcagaa 
ttgtcgagtg 
tccgggaagg 
aagcctcacg 
cagtggcatg 
ggaagccttc 
ttgtgaagct 
tgctctacct 
tgctggtgga 
accacgtggc 
ggccccggca 
tgcagggagt 
tggaccccat 
gaatcatctg 
tgaaggaact 
cggaggagta 
agagagccca 
agctcgccct 
tgaccaaggt 
gcatgctggc 
aggatgtggg 
tccacctgaa 
tggacttcca 



tcgctggcag 
ccgtggggga 
acggccgctg 
aagggacaca 
gcttcacatg 
tctgatctat 
agaggactgt 
gccctcagcc 
aagggtcact 
gagctggagg 
gattcctgaa 
tgatgctggg 
taccaagccc 
gccaaccgat 
cgtccgagtc 
ggggctggac 
gaagccgcgg 
ggcctggatc 
gcgggacttc 
cacctggggc 
cgagtacata 
tcacttcatc 
gaccaggctg 
gatccccatc 
cgccaccacc 
cattgcccag 
actgagcgtg 
gaccaaaaag 
caaggatggt 
gagccaagac 
gaatcgcgtg 
agatctgcac 
cttcctggaa 
gtacgagaac 
ggatcccgag 
ccaggtgctg 
caacgaccgg 
gagcctccgg 
ccgtgacctt 
ggctcagagc 
gtccaaggag 
tgagcacaga 
gaaactgctc 
ggaggccaag 
gtggtggggc 
cttcccgctg 
cacccccgcg 
catcctctcc 
gcctgtgccc 



ctggtgctgg 
gggagctctt 
ggacccccgc 
tctcacgcag 
aggaagcatg 
ttcaccaacc 
cctgagggca 
ctgaggaaag 
gacctgggga 
ctacagtgcc 
aacatcaaga 
aaggtggtgt 
cacaccttcc 
gcctttggcg 
tcccaggaca 
gtccccaatc 
ctgaagagca 
atcacagggg 
agcctgagca 
actgtccacc 
ctggatgagg 
ctcgtggacg 
gagaagttca 
gtgtgcgtgg 
aacggcaccc 
gtggccaacc 
ttcttccagg 
atccaagata 
cagcaggacg 
cactttggcc 
gacattgccc 
cccacgatga 
aacggggtgc 
ctggacccct 
cgcccggctt 
cgggagctgc 
ctgcggctcc 
tccctctaca 
ctcatttggg 
caggactgca 
gaggaggaca 
gccatcgggg 
acccgcgtgt 
gacatgaagt 
cagctctccg 
ctcctcaccg 
gcccgcgccc 
tacttcgcct 
tcctggtgcg 



cggtggagac 
tctctggccc 
atgtgcatgt 
ctgggaacat 
tggccaggtc 
accccatgtg 
gcaggcctgg 
ctggctcgga 
tggtctccaa 
ccttcggcaa 
agaaagaatg 
gtcagtgtgg 
agggcacaca 
acatcgtctt 
cgccctccag 
tcttgatctc 
ttttccgcag 
ggtcccacac 
gcagctacaa 
gccgcgaggg 
atggccaagg 
acgggaccca 
tatcggagca 
tgctggaggg 
cctgtgtggt 
tgcctgtctc 
agatgtttga 
ttgtccggag 
tggatgtggc 
acgagaactg 
gcagtgagat 
cagctgcact 
agctgaagga 
cctgcctgtt 
gcgcgcccgc 
tgggggactt 
tgctgcccgt 
agcgttcctc 
ccattgtcca 
tcgcagcggc 
cggacagctc 
tcttcaccga 
ccgaggcctg 
ttgtgtctca 
tggacaatgg 
gcctcatctc 
gtgccttctt 
tcctctgcct 
agtgtgccat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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ctacctctgg ctcttctcct tggtgtgcga ggagatgcgg cagctcttct atgaccctga 3000 

cgagtgcggg ctgatgaaga aggcagcctt gcacttcagt gacttctgga ataagctgga 3060 

cgtcggcgca atcttgctct tcgtggcagg gctgacctgc aggctcatcc cggcgacgct 3120 

gtaccccggg cgcgtcatcc tctctctgga cttcatcctg ttctgcctcc ggctcatgca 3180 

catttttacc atcagtaaga cgctggggcc caagatcatc attgtgaagc ggatgatgaa 3240 

ggacgtcttc ttcttcctct tcctgctggc tgtgtgggtg gtgtccttcg gggtggccaa 3300 

gcaggccatc ctcatccaca acgagcgccg ggtggactgg ctgttccgag gggccgtcta 3360 

ccactcctac ctcaccatct tcgggcagat cccgggctac atcgacggtg tgaacttcaa 3420 

cccggagcac tgcagcccca atggcaccga cccctacaag cctaagtgcc ccgagagcga 3480 

cgcgacgcag cagaggccgg ccttccctga gtggctgacg gtcctcctac tctgcctcta 3540 

cctgctcttc accaacatcc tgctgctcaa cctcctcatc gccatgttca actacacctt 3600 

ccagcaggtg caggagcaca cggaccagat ttggaagttc cagcgccatg acctgatcga 3660 

ggagtaccac ggccgccccg ccgcgccgcc ccccttcatc ctcctcagcc acctgcagct 3720 

cttcatcaag agggtggtcc tgaagactcc ggccaagagg cacaagcagc tcaagaacaa 3780 

gctggagaag aacgaggagg cggccctgct atcctgggag atctacctga aggagaacta 3840 

cctccagaac cgacagttcc agcaaaagca gcggcccgag cagaagatcg aggacatcag 3900 

caataaggtt gacgccatgg tggacctgct ggacctggac ccactgaaga ggtcgggctc 3960 

catggagcag aggttggcct ccctggagga gcaggtggcc cagacagccc gagccctgca 4 020 

ctggatcgtg aggacgctgc gggccagcgg cttcagctcg gaggcggacg tccccactct 4 080 

ggcctcccag aaggccgcgg aggagccgga tgctgagccg ggaggcagga agaagacgga 4140 

ggagccgggc gacagctacc acgtgaatgc ccggcacctc ctctacccca actgccctgt 4 200 

cacgcgcttc cccgtgccca acgagaaggt gccctgggag acggagttcc tgatctatga 4 2 60 

cccacccttt tacacggcag agaggaagga cgcggccgcc atggacccca tgggagacac 4 320 

cctggagcca ctgtccacga tccagtacaa cgtggtggat ggcctgaggg accgccggag 4 380 

cttccacggg ccgtacacag tgcaggccgg gttgcccctg aaccccatgg gccgcacagg 4 4 40 

actgcgtggg cgcgggagcc tcagctgctt cggacccaac cacacgctgt accccatggt 4 500 

cacgcggtgg aggcggaacg aggatggagc catctgcagg aagagcataa agaagatgct 4 560 

ggaagtgctg gtggtgaagc tccctctctc cgagcactgg gccctgcctg ggggctcccg 4 620 

ggagccaggg gagatgctac ctcggaagct gaagcggatc ctccggcagg agcactggcc 4 680 

gtcttttgaa aacttgctga agtgcggcat ggaggtgtac aaaggctaca tggatgaccc 474 0 

gaggaacacg gacaatgcct ggatcgagac ggtggccgtc agcgtccact tccaggacca 4800 

gaatgacgtg gagctgaaca ggctgaactc taacctgcac gcctgcgact cgggggcctc 4 860 

catccgatgg caggtggtgg acaggcgcat cccactctat gcgaaccaca agaccctcct 4 920 

ccagaaggca gccgctgagt tcggggctca ctactgactg tgccctcagg ctgggcggct 4980 

ccagtccata gacgttcccc ccagaaacca gggcttctct ctcctgagcc tggccaggac 5040 

tcaggctgtt cctgggccct gcacatgatg gggtttggtg gacccagtgc ccctcacggc 5100 

tgccgcaagt ctgctgcaga tgacctcatg aactggaagg ggtcaaggtg acccgggagg 5160 

agagctcaag acagggcaca ggctactcag agctgagggg cccctgggac ccttggccat 5220 

caggcgaggg gctgggcctg tgcagctggg cccttggcca gagtccactc ccttcctggc 5280 

tgtgtcaccc cgagcagctc atccaccatg gaggtcattg gcctgaggca agttccccgg 5340 

agagtcggga tcccctgtgg ccccctcagg cctatgtctg tgaggaaggg gccctgccac 5400 

tctccccaag agggcctcca tgtttcgagg tgcctcaaca tggagccttg cctggcctgg 54 60 

gctaggggca ctgtctgaac tcctgactgt caggataaac tccgtggggg tacaggagcc 5520 

cagacaaagc ccaggcctgt caagagacgc agagggcccc tgccagggtt ggccccaggg 5580 

accctgggac gaggctgcag aagctctccc tccctactcc ctgggagcca cgtgctggcc 5640 

atgtggccag ggacggcatg agcaggaggc ggggacgtgg gggccttctg gtttggtgtc 5700 

aacagctcac aggagcgtga accatgaggg ccctcaggag gggaacgtgg taaaacccaa 57 60 

gacattaaat ctgccatctc aggcctggct ggctcttctg tgctttccac aaataaagtt 5820 

cctgacacgt ccagggccag gggctgtgtg acggctgcct gaagttctcc tcgatccccc 58 80 

ggtgagcttc ctgcagcctg tggatgtcct gcagcccctc agccctaccc ccaagtttct 594 0 

cctctgaccc atcagctccc tgtcttcatt ttcctaaacc tgggctccag catcgtcccc 6000 

aagcccacca ggccaggatg caggcatcca catgccctcc tccttggctt cccctgcgtg 6060 

gtggtgccaa tgtgccctgg cacccctgca gaggctccgg atggagcctg gggctgcctg 6120 

gccactgagc actggccgag gtgatgccca cccttccctg gacaggcctc tgtcttccac 618 0 

ctgacccaaa gctctctagc cacccccttg tccccagtat 6220 



<210> 12 
<211> 1503 
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<212> PRT 

<213> Homo Sapiens 



<400> 12 



Met 


Glu 


Pro 


Ser 


Ala 


Leu 


Arg 


Lys 


A J. a 


oxy 


Ser 


Glu 


Gin 


o J.U 






1 








5 




















15 




Phe 


Glu 


Gly 


Leu 


Pro 


Arg 


Arg 


Val 


Thr 


Asp 


Leu 


Gly 


Met 


Val 


Ser 


Asn 






20 










25 










30 






Leu 


Arg 


Arg 


Ser 


Asn 


Ser 


Ser 


Leu 


Phe 


Lys 


Ser 


Trp 


Arg 


Leu 


Gin 


Cys 




35 










40 










45 








Pro 


Phe 


Gly 


Asn 


Asn. Asp 


Lys 


Gin 


Glu 


Ser 


Leu 


Ser 


Ser 


Trp 


He 


Pro 




50 








55 










60 










Glu 


Asn 


He 


Lys 


Lys 


Lys 


Glu 


Cys 


Val 


Tyr 


Phe 


Val 


Glu 


Ser 


Ser 


Lys 


65 










70 










75 










o U 


Leu 


Ser 


Asp 


Ala 


Gly 


Lys 


Val 


Val 


Cys 


Gin 


Cys 


Gly Tyr 


Thr 


His 


Glu 










85 










90 










95 




Gin 


His 


Leu 


Glu 
100 


Glu 


Ala 


Thr 


Lys 


Pro 
105 


His 


Thr 


Phe 


Gin 


Gly 
110 


Thr 


Gin 


Trp 


Asp 


Pro 


Lys 


Lys 


His 


Val 


Gin 


Glu 


Met 


Pro 


Thr 


Asp 


Ala 


Phe 


Gly 




115 










120 










125 








Asp 


He 


Val 


Phe 


Thr 


Gly 


Leu 


Ser 


Gin 


Lys 


Val 


Lys 


Lys 


Tyr. Val 


Arg 


130 










135 










140 










Val 


Ser 


Gin 


Asp 


Thr 


Pro 


Ser 


Ser 


Val 


He 


Tyr 


His 


Leu 


Met 


Thr 


Gin 


145 








150 










155 










1 bU 


His 


Trp 


Gly 


Leu 


Asp 


Val 


Pro 


Asn 


Leu 


Leu 


He 


Ser 


Val 


Thr 


Gly 


Gly 






165 










170 










175 




Ala 


Lys 


Asn 


Phe 


Asn 


Met 


Lys 


Pro 


Arg 


Leu 


Lys 


Ser 


He 


Pne 


Arg 


Arg 






180 










185 










190 






Gly 


Leu 


Val 


Lys 


Val 


Ala 


Gin 


Thr 


Thr 


Gly 


Ala 


Trp 


He 


He 


Thr 


r*! XT 
K3±y 




195 








200 










205 








Gly 


Ser 


His 


Thr 


Gly 


Val 


Met 


Lys 


Gin 


Val 


Gly 


Glu 


Ala 


Val 


Arg 


Asp 


210 










215 










220 










Phe 


Ser 


Leu 


Ser 


Ser 


Ser 


Tyr 


Lys 


Glu 


Gly 


Glu 


Leu 


He 


Thr 


He 


o±y 


225 










230 










235 










240 


Val 


Ala 


Thr 


Trp 


Gly 
245 


Thr 


Val 


His 


Arg 


Arg 

OCA 


Glu 


Gly 


Leu 


He 


His 
255 


Pro 


Thr 


Gly 


Ser 


Phe 


Pro 


Ala 


Glu 


Tyr 


He 


Leu 


Asp 


Glu 


Asp 


Gly 


Gin 








260 










265 










270 






Asn 


Leu 


Thr 


Cys 


Leu 


Asp 


Ser 


Asn 


His 


Ser 


His 


Phe 


He 


Leu 


Val 


Asp 






275 








280 










285 








Asp 


Gly 


Thr 


His 


Gly 


Gin 


Tyr 


Gly 


Val 


Glu 


He 


Pro 


Leu 


Arg 


Thr 


Arg 


290 










295 










300 










Leu 


Glu 


Lys 


Phe 


He 


Ser 


Glu 


Gin 


Thr 


Lys 


GiU 


Arg 


Gly 


Gly 


Val 


a 1 a 


r\ c 

JUb 








310 










315 










320 


lie 


Lys 


He 


Pro 


He 


Val 


Cys 


Val 


Val 


Leu 


Glu 


Gly 


Gly 


Pro 


Gly 


Thr 








325 










330 










335 




Leu 


His 


Thr 


He 


Asp 


Asn 


Ala 


Thr 


Thr 


Asn 


Gly 


Thr 


Pro 


Cys 


Val 


Val 








340 








345 










350 






Val 


Glu 


Gly 
355 


Ser 


Gly 


Arg 


Val 


Ala 
360 


Asp 


Val 


He 


Ala 


Gin 
365 


Val 


Ala 


Asn 


Leu 


Pro 


Val 


Ser 


Asp 


He 


Thr 


He 


Ser 


Leu 


He 


Gin 


Gin 


Lys 


Leu 


Ser 




370 








375 










380 










Val 


Phe 


Phe 


Gin 


Glu 


Met 


Phe 


Glu 


Thr 


Phe 


Thr 


Glu 


Ser 


Arg 


He 


Val 


385 










390 










395 










400 


Glu 


Trp 


Thr 


Lys 


Lys 


He 


Gin 


Asp 


He 


Val 


Arg 


Arg 


Arg 


Gin 


Leu 


Leu 






405 










410 










415 




Thr 


Val 


Phe 


Arg 
420 


Glu 


Gly 


Lys 


Asp 


Gly 
425 


Gin 


Gin 


Asp 


Val 


Asp 
430 


Val 
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Arg Met Met Lys Asp Val Phe Phe Phe Leu Phe Leu Leu Ala Val Trp 

930 935 940 

Val Val Ser Phe Gly Val Ala Lys Gin Ala lie Leu lie His Asn Glu 
945 950 955 960 

Arg Arg Val Asp Trp Leu Phe Arg Gly Ala Val Tyr His Ser Tyr Leu 

965 970 975 

Thr lie Phe Gly Gin lie Pro Gly Tyr lie Asp Gly Val Asn Phe Asn 

980 985 990 

Pro Glu His Cys Ser Pro Asn Gly Thr Asp Pro Tyr Lys Pro Lys Cys 

995 1000 1005 

Pro Glu Ser Asp Ala Thr Gin Gin Arg Pro Ala Phe Pro Glu Trp Leu 

1010 1015 1020 

Thr Val Leu Leu Leu Cys Leu Tyr Leu Leu Phe Thr Asn lie Leu Leu 
1025 1030 1035 104 

Leu Asn Leu Leu lie Ala Met Phe Asn Tyr Thr Phe Gin Gin Val Gin 

1045 1050 1055 

Glu His Thr Asp Gin lie Trp Lys Phe Gin Arg His Asp Leu He Glu 

1060 1065 1070 

Glu Tyr His Gly Arg Pro Ala Ala Pro Pro Pro Phe He Leu Leu Ser 

1075 1080 1085 

His Leu Gin Leu Phe He Lys Arg Val Val Leu Lys Thr Pro Ala Lys 

1090 1095 1100 

Arg His Lys Gin Leu Lys Asn Lys Leu Glu Lys Asn Glu Glu Ala Ala 
1105 1110 1115 112 

Leu Leu Ser Trp Glu He Tyr Leu Lys Glu Asn Tyr Leu Gin Asn Arg 

1125 1130 1135 

Gin Phe Gin Gin Lys Gin Arg Pro Glu Gin Lys He Glu Asp He Ser 

1140 1145 1150 

Asn Lys Val Asp Ala Met Val Asp Leu Leu Asp Leu Asp Pro Leu Lys 

1155 1160 1165 

Arg Ser Gly Ser Met Glu Gin Arg Leu Ala Ser Leu Glu Glu Gin Val 

1170 1175 1180 

Ala Gin Thr Ala Arg Ala Leu His Trp He Val Arg Thr Leu Arg Ala 
1185 1190 1195 120 

Ser Gly Phe Ser Ser Glu Ala Asp Val Pro Thr Leu Ala Ser Gin Lys 

1205 1210 1215 

Ala Ala Glu Glu Pro Asp Ala Glu Pro Gly Gly Arg Lys Lys Thr Glu 

1220 1225 1230 

Glu Pro Gly Asp Ser Tyr His Val Asn Ala Arg His Leu Leu Tyr Pro 

1235 1240 . 1245 

Asn Cys Pro Val Thr Arg Phe Pro Val Pro Asn Glu Lys Val Pro Trp 

1250 1255 1260 

Glu Thr Glu Phe Leu He Tyr Asp Pro Pro Phe Tyr Thr Ala Glu Arg 
1265 1270 1275 128 

Lys Asp Ala Ala Ala Met Asp Pro Met Gly Asp Thr Leu Glu Pro Leu 

1285 1290 1295 

Ser Thr He Gin Tyr Asn Val Val Asp Gly Leu Arg Asp Arg Arg Ser 

1300 1305 1310 

Phe His Gly Pro Tyr Thr Val Gin Ala Gly Leu Pro Leu Asn Pro Met 

1315 1320 1325 

Gly Arg Thr Gly Leu Arg Gly Arg Gly Ser Leu Ser Cys Phe Gly Pro 

1330 1335 1340 

Asn His Thr Leu Tyr Pro Met Val Thr Arg Trp Arg Arg Asn Glu Asp 
1345 1350 1355 136 

Gly Ala He Cys Arg Lys Ser He Lys Lys Met Leu Glu Val Leu Val 

1365 1370 1375 

Val Lys Leu Pro Leu Ser Glu His Trp Ala Leu Pro Gly Gly Ser Arg 

1380 1385 1390 

Glu Pro Gly Glu Met Leu Pro Arg Lys Leu Lys Arg He Leu Arg Gin 
1395 1400 1405 
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Glu His Trp Pro Ser Phe Glu Asn Leu Leu Lys Cys Gly Met Glu Val 

1410 ' 1415 1420 

Tyr Lys Gly Tyr Met Asp Asp Pro Arg Asn Thr Asp Asn Ala Trp lie 
1425 1430 1435 144 

Glu Thr Val Ala Val Ser Val His Phe Gin Asp Gin Asn Asp Val Glu 

1445 1450 1455 

Leu Asn Arg Leu Asn Ser Asn Leu His Ala Cys Asp Ser Gly Ala Ser 

1460 1465 1470 

lie Arg Trp Gin Val Val Asp Arg Arg He Pro Leu Tyr Ala Asn His 

1475 1480 1485 

Lys Thr Leu Leu Gin Lys Ala Ala Ala Glu Phe Gly Ala His Tyr 
1490 1495 1500 
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Leu 


Leu 


Gly 


Phe 


Lys 


Thr 


Lys 


Glu 


Gin 


Leu 


Met 


Leu 


Gin 


Pro 


Lys 


Thr 








980 










985 










990 






Ala 


Ala 


Glu 


His 


Asp 


Glu 


Glu 


Met 


Ser 


Asp 


Ser 


Glu 


Met 


Asn 


Ser 


Ala 






995 










1000 








1005 






Glu 


Asp 


Thr 


Asp 


Thr 


Ser 


Ser 


Asp 


Ser 


Ser 


Ser 


Asp 


Ser 


Asp 


Asp 


Ser 




1010 








1015 








1020 








Asp 


Glu 


Glu 


Asp 


Ala 


Lys 


Leu 


Arg 


Ala 


Gin 


Ser 


Leu 


Ser 


Ala 


Asp 


Gin 


1025 








1030 








1035 








104 


Pro 


Leu 


Ser 


He 


His 


Arg 


Leu 


Val 


Arg 


Asp 


Lys 


Leu 


Asn 


Phe 


Ser 


Glu 










1045 








1050 








1055 


Lys 


Lys 


Lys 


Pro 


Asp Met 


Gly 


He 


Ser 


Arg 


He 


Val 


Val 


Ala 


Pro 


Pro 








1060 








1065 








1070 




He 


Val 


Thr 


Gly Arg Asn Arg 


Ala 


Arg 


Thr 


Met 


Ser 


He 


Lys 


Lys 


Ser 



1075 1080 1085 

Lys Lys Asn Val He Lys Pro Pro Ala Cys Leu Lys He Glu Thr Ser 

1090 1095 1100 

Asp Asp Asp Glu Gin Glu Gin Lys Lys Ala Thr Glu Met Cys Lys Ser 

1105 1110 1115 112 
Thr Phe Phe Asp Phe Phe Phe Asp Phe Pro Tyr He Asn Arg Thr Gly 

1125 1130 1135 
Lys Arg Gly Ser Val Ala Val Ala Met Asn His Asp Asp Met Tyr He 

1140 1145 1150 

Asp Pro Ser Glu Glu Leu Asp Thr Gin Thr Arg Gin Lys Ser Ser Arg 

1155 1160 1165 

Glu Phe Ser Ser Ser Arg Asn Val Thr Val Gin Val Tyr Thr Gin Arg 

1170 1175 1180 

Pro Leu Ser Trp Lys Lys Lys He Met Glu Phe Tyr Lys Ala Pro He 

1185 1190 1195 120 

Thr Thr Tyr Trp Leu Trp Phe Phe Ala Phe He Trp Phe Leu He Leu 

1205 1210 1215 

Leu Thr Tyr Asn Leu Leu Val Lys Thr Gin Arg He Ala Ser Trp Ser 

1220 1225 1230 

Glu Trp Tyr Val Phe Ala Tyr He Phe Val Trp Thr Leu Glu He Gly 

1235 1240 1245 

Arg Lys Val Val Ser Thr He I-3et Met Asp Thr Ser Lys Pro Val Leu 

1250 1255 1260 

Lys Gin Leu Arg Val Phe Phe Phe Gin Tyr Arg Asn Gly Leu Leu Ala 

1265 1270 1275 128 

Phe Gly Leu Leu Thr Tyr Leu He Ala Tyr Phe He Arg Leu Ser Pro 
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1285 1290 1295 

Thr Thr Lys Thr Leu Gly Arg He Leu He He Cys Asn Ser Val He 

1300 1305 1310 

Trp Ser Leu Lys Leu Val Asp Tyr Leu Ser Val Gin Gin Gly Leu Gly 

1315 1320 1325 

Pro Tyr He Asn He Val Ala Glu Met He Pro Thr Met He Pro Leu 

1330 1335 1340 

Cys Val Leu Val Phe He Thr Leu Tyr Ala Phe Gly Leu Leu Arg Gin 
1345 1350 1355 136 

Ser He Thr Tyr Pro Tyr Glu Asp Trp His Trp He Leu Val Arg Asn 

1365 1370 1375 

He Phe Leu Gin Pro Tyr Phe Met Leu Tyr Gly Glu Val Tyr Ala Ala 

1380 1385 1390 

Glu He Asp Thr Cys Gly Asp Glu He Trp Gin Thr His Glu Asp Glu 

1395 1400 1405 

Asn He Pro He Ser Met Leu Asn Val Thr His Glu Thr Cys Val Pro 

1410 1415 1420 

Gly Tyr Trp He Ala Pro Val Gly Leu Thr Val Phe Met Leu Ala Thr 
1425 1430 1435 144 

Asn Val Leu Leu Met Asn Val Met Val Ala Gly Cys Thr Tyr He Phe 

1445 1450 1455 

Glu Lys His He Gin Ser Thr Arg Glu He Phe Leu Phe Glu Arg Tyr 

1460 1465 1470 

Gly Gin Val Met Glu Tyr Glu Ser Thr Pro Trp Leu Pro Pro Pro Phe 

1475 1480 1485 

Thr He He Tyr His Val He Trp Leu Phe Lys Leu He Lys Ser Ser 

1490 1495 1500 

Ser Arg Met Phe Glu Arg Lys Asn Leu Phe Asp Gin Ser Leu Lys Leu 
1505 1510 1515 152 

Phe Leu Ser Pro Asp Glu Met Glu Lys Val His Thr Phe Glu Glu Glu 

1525 1530 1535 

Ser Val Glu Asp Met Lys Arg Glu Thr Glu Lys Lys Asn Leu Ser Ser 

1540 1545 1550 

Asn Asp Glu Arg He His Arg Thr Ala Glu Arg Thr Asp Ala He Leu 

1555 1560 1565 

Asn Arg Val Ser His Leu Thr Gin Leu Glu Phe Thr Leu Lys Glu Glu 

1570 1575 1580 

He Arg Glu Leu Glu His Lys Met Lys Asn Met Asp Ser Arg His Lys 
1585 1590 1595 160 

Glu Gin Met Asn Leu Met Leu Asp Met Asn Lys Lys Leu Gly Lys Phe 

1605 1610 1615 

He Ser Gly Lys Tyr Lys Arg Gly Ser Phe Gly Gly Ser Gly Ser Asp 

1620 1625 1630 

Gly Gly Gly Gly Ser Ser Asp Asn Ser Lys Leu Glu Pro Asn Asn Ser 

1635 1640 1645 

Val Pro Met He Thr Val Asp Gly Pro Ser Pro He Gly Ser Arg Arg 

1650 1655 1660 

Thr Ser Gly Gin Tyr Leu Lys Arg Asp Ser Leu Gin Ala Lys Lys Lys 
1665 1670 1675 168 

He Thr Glu Asn Arg Arg Ser Ser Leu Glu Gin Pro Lys He Pro Ser 

1685 1690 1695 

He Gin Phe Asn Leu Met Glu Asp Gin Asp Glu Ser Ala Ala Glu Ser 

1700 1705 1710 

Ala Thr Glu Glu Val Ser He Ser He Pro Val Pro Gin Met Arg Val 

1715 1720 1725 

Arg Gin Val Thr Glu Ser Asp Lys Ser Asp Leu Ser Glu Asp Asp Leu 

1730 1735 1740 

He Thr Arg Glu Asp Ala Pro Pro Thr Ser He Asn Leu Pro Arg Gly 
1745 1750 1755 176 

Pro Arg Arg His Ala Leu Tyr Ser Thr He Ala Asp Ala He Glu Thr 
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Glu Asp Asp Phe Tyr Ala 
1780 

Val Gin Pro Ala Asp Gly 
1795 

Gin Arg Asp Asp Ser Asp 
1810 
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1770 

Asp Ser Pro Val Pro Met 
1785 

Ser Phe Phe Gly Glu Asn 

1800 
Tyr Glu 
1815 
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1775 
Pro Met Thr Pro 

1790 
Asp Ser Arg Tyr 
1805 



<210> 14 
<211> 1387 
<212> PRT 
<213> C. Elegans 





<400> 


14 


Met 


Arg 


Lys 


Ser 


1 








lie 


Glu 


Asn 


He 








20 


Ala 


Pro 


Arg 


Asn 






35 




Ser 


Phe 


Arg 


Ser 




50 






Asp 


Asn 


Pro 


Asn 


65 








Pro 


Trp 


He 


Glu 


Ala 


Leu 


Pro 


Lys 








100 


Ala 


His 


Thr 


Pro 






115 




Leu 


Glu 


Lys 


Glu 




130 






Ser 


Thr 


Thr 


Asp 


145 








Ala 


His 


Lys 


Ala 


Asp 


Val 


Met 


Tyr 








180 


Leu 


Val 


He 


Thr 






195 




Arg 


Leu 


Gly 


Arg 




210 






Thr 


Gly 


Ala 


Trp 


225 








His 


Val 


Ala 


Lys 


Ser 


Gin 


He 


Val 








260 


Lys 


Glu 


Arg 


Leu 






275 




Ser 


Leu 


Ser 


Val 




290 






Tyr 


Phe 


Leu 


Leu 


305 








Leu 


His 


Leu 


Arg 


Asn 


Gly 


Arg 


Lys 








340 


Ser 


Ser 


He 


Asn 






355 





Arg 


Arg 


Val 


Arg 


5 








Arg 


His 


Arg 


Thr 


Ser 


Met 


Cys 


Asn 








4 0 


Asp 


His 


Leu 


Ser 






55 




Leu 


Phe 


Ala 


He 




70 






Asn 


Thr 


Phe 


Glu 


85 








Asp 


Pro 


Glu 


Arg 


Ala 


Ser 


Thr 


Pne 








120 


Gin 


Gin 


Thr 


Trp 






135 




Ala 


Phe 


Gly 


Thr 










Gin 


Tyr 


Val 


Arg 


165 








Leu 


Met 


Glu 


Lys 


Val 


His 


Gly 


Gly 








200 


Leu 


Phe 


Arg 


Lys 






215 




He 


He 


Thr 


Ser 




230 






Ala 


Leu 


Asp 


Glu 


245 








Thr 


He 


Gly 


He 


He 


Arg 


Gin 


Asn 








280 


Asn 


Ala 


Asn 


Val 






295 




Ala 


Asp 


Asn 


Gly 




310 






Gin 


Asn 


Leu 


Glu 


325 








Val 


Pro 


Val 


Val 


Ala 


He 


His 


Asp 
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Lys Arg Glu Cys 
90 

Cys Gly Cys Gly 
105 

Phe Ser Thr Leu 

Thr He Ala Asn 
140 

He Val Phe Gin 
155 

Leu Ser Tyr Asp 
170 
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He 


He 
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He 
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He 
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545 
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560 


Leu 


Thr 


He 
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He 
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Lys 
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Ser 
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Lys 
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He 
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Lys 


Lys 
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Glu 
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Ala 


Tyr 


Pro 
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He 


Trp 
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Met 
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He 
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He 
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Asp 
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Cys 
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700 










Ala 


He 


Glu 


Leu 


Leu 


Asp 


Gin 
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Lys 


Val 






755 










760 










765 








Arg 


Ser 


Gly 


Ser 


Asn 


Val 
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Glu 


Thr 


Pro 


Glu 
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Leu 
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Arg 


Glu 


Ser 
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He 
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Ser 


Thr 


Thr 


Ser 
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Arg 
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Arg 
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Ser 


Lys 


Gly 
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Glu 








820 
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Glu 
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Gly 


Ser 
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Thr 


Lys 


Lys 


Val 






835 










840 










845 
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Thr He He Ser Ser Arg Lys Asn Ser Gly Vai Ala Ser Val Tyr Gly 

850 855 860 

Ser Ala Ser Ser Met Met Phe Lys Arg Glu Pro Gin Leu Asn Lys Phe 
865 870 875 880 

Glu Arg Phe Arg Ala Phe Tyr Ser Ser Pro He Thr Lys Phe Trp Ser 

885 890 895 

Trp Cys He Ala Phe Leu He Phe Leu Thr Thr Gin Thr Cys He Leu 

900 905 910 

Leu Leu Glu Thr Ser Leu Lys Pro Ser Lys Tyr Glu Trp He Thr Phe 

915 920 925 

He Tyr Thr Val Thr Leu Ser Val Glu His He Arg Lys Leu Met Thr 

930 935 940 

Ser Glu Gly Ser Arg He Asn Glu Lys Val Lys Val Phe Tyr Ala Lys 
945 950 955 960 

Trp Tyr Asn He Trp Thr Ser Ala Ala Leu Leu Phe Phe Leu Val Gly 

965 970 975 

Tyr Gly Phe Arg Leu Val Pro Met Tyr Arg His Ser Trp Gly Arg Val 

980 985 990 

Leu Leu Ser Phe Ser Asn Val Leu Phe Tyr Met Lys He Phe Glu Tyr 

995 1000 1005 

Leu Ser Val His Pro Leu Leu Gly Pro Tyr He Gin Met Ala Ala Lys 

1010 1015 1020 

Met Val Trp Ser Met Cys Tyr He Cys Val Leu Leu Leu Val Pro Leu 
1025 1030 1035 104 

Met Ala Phe Gly Val Asn Arg Gin Ala Leu Thr Glu Pro Asn Val Lys 

1045 1050 1055 

Asp Trp His Trp Leu Leu Val Arg Asn He Phe Tyr Lys Pro Tyr Phe 

1060 1065 1070 

Met Leu Tyr Gly Glu Val Tyr Ala Gly Glu He Asp Thr Cys Gly Asp 

1075 1080 1085 

Glu Gly He Arg Cys Phe Pro Gly Tyr Phe He Pro Pro Leu Leu Met 

1090 1095 1100 

Val He Phe Leu Leu Val Ala Asn He Leu Leu Leu Asn Leu Leu He 
1105 1110 1115 112 

Ala He Phe Asn Asn He Tyr Asn Asp Ser He Glu Lys Ser Lys Glu 

1125 1130 1135 

He Trp Leu Phe Gin Arg Tyr Gin Gin Leu Met Glu Tyr His Asp Ser 

1140 1145 1150 

Pro Phe Leu Pro Pro Pro Phe Ser He Phe Ala His Val Tyr His Phe 

1155 1160 1165 

He Asp Tyr Leu Tyr Asn Leu Arg Arg Pro Asp Thr Lys Arg Phe Arg 

1170 1175 1180 

Ser Glu His Ser He Lys Leu Ser Val Thr Glu Asp Glu Met Lys Arg 
1185 1190 1195 120 

He Gin Asp Phe Glu Glu Asp Cys He Asp Thr Leu Thr Arg He Arg 

1205 1210 1215 

Lys Leu Lys Leu Asn Thr Lys Glu Pro Leu Ser Val Thr Asp Leu Thr 

1220 1225 1230 

Glu Leu Thr Cys Gin Arg Val His Asp Leu Met Gin Glu Asn Phe Leu 

1235 1240 1245 

Leu Lys Ser Arg Val Tyr Asp He Glu Thr Lys He Asp His He Ser 

1250 1255 1260 

Asn Ser Ser Asp Glu Val Val Gin He Leu Lys Asn Lys Lys Leu Ser 
1265 1270 1275 128 

Gin Asn Phe Ala Ala Ser Ser Leu Ser Leu Pro Asp Thr Ser He Glu 

1285 1290 1295 

Val Pro Lys He Thr Lys Thr Leu He Asp Cys His Leu Ser Pro Val 

1300 1305 1310 

Ser He Glu Asp Arg Leu Ala Thr Arg Ser Pro Leu Leu Ala Asn Leu 
1315 1320 1325 
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Gin Arg Asp His Thr Leu Arg Lys 

1330 1335 
Ala Ser Thr Ser Ser Phe Glu Phe 
1345 1350 
Gly Asn Glu Asn Lys Tyr Glu Phe 
1365 

Trp Arg Asn Asn Tyr Val lie Ser 
1380 
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Leu Pro Thr Trp Glu Thr Ser Thr 
1340 

Val Phe Tyr Phe Thr Arg His Glu 
1355 136 
Lys Lys Leu Glu Lys Gly Gly Phe 

1370 1375 
Trp Arg Leu 
1385 
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205 








Glu 


Phe 


Gin 


Gly 


Gly 


Pro 


His 


Pro 








220 










Asn 


Phe 


Asp 


Thr 


Glu 


Pro 


Ala 


Tyr 






235 










240 


Trp 


Gin 


He 


Ser 


Pro 


Pro 


Arg 


Leu 




250 










255 




Ser 


Asn 


Phe 


Asp 


Leu 


Gin 


Pro 


Lys 


265 










270 






Leu 


Leu 


Lys 


Ala 


Ala 


Ser 


Thr 


Thr 










285 








Cys 


Asp 


Thr 


Gly 


Val 


Val 


Lys 


His 








300 










Gin 


Ser 


Ala 


Gin 


Arg 


Asn 


Lys 


He 






315 










320 


Gly 


Leu 


Leu 


Lys 


Lys 


Arg 


Glu 


Asp 




330 










335 




Pro 


Tyr 


Tyr 


Pro 


Ser 


Ser 


Ser 


Lys 


345 










350 






Arg 


His 


Ser 


Tyr 


Phe 


Leu 


Leu 


Val 



wo 00/4061 4 



355 



Asp 


Asn 


Gly 


Thr 


Val 


Gly 


Arg 




370 










37 5 


Arg 


Leu 


Glu 


Met 


Tyr 


He 


Ser 


385 










390 




Arg 


Ser 


Val 


Pro 


Val 


Val 


Cys 








405 






lie 


Arg 


Ser 


Val 


Leu 


Asp 


Tyr 








420 








Val 


Val 


Cys 


Asp 


Gly 


Ser 


Gly 






435 










His 


Gin 


Asn 


Val 


Thr 


Glu 


Asp 




450 










455 


Gin 


Val 


Leu 


Leu 


Leu 


Val 


Glu 


465 










470 




Ala 


His 


Arg 


Leu 


Leu 


His 


Glu 










485 






Leu 


Leu 


Thr 


lie 


Phe 


Arg 


Leu 








500 








His 


Ala 


lie 


Leu 


Thr 


Ala 


Leu 






515 










Asp 


Gin 


Leu 


Ala 


Leu 


Ala 


Leu 


530 










535 


Ser 


Asp 


Val 


Phe 


Ala 


Met 


Gly 


545 










550 




Asn 


Ala 


Met 


Met 


Glu 


Ala 


Leu 










565 






Leu 


Leu 


Leu 


Glu 


Gin 


Gly 


He 








580 








Arg 


Leu 


Asp 


Glu 


Leu 


Tyr 


Asn 






595 










Phe 


Tyr 


lie 


Val 


Arg 


Asp 


Val 




610 










615 


Lys 


Leu 


Pro 


Asp 


He 


Gly 


Leu 


625 










630 




Tyr 


Gin 


Cys 


Ser 


Tyr 


Thr 


Thr 










645 






Arg 


Met 


Lys 


Arg 


Val 


Lys 


His 








660 








Ser 


Ser 


Arg 


Pro 


Ser 


Arg 


Thr 






675 










Thr 


Glu 


Gly 


Met 


Gly 


Gly 


Val 




690 










695 


Phe 


Gly 


Asn 


Ser 


Phe 


Gly 


Asn 


705 










710 




Asn 


Arg 


Ser 


Ala 


Leu 


Ser 


Gly 










725 






Trp 


Arg 


Ser 


Ala 


Phe 


Arg 


Gly 








740 








Pro 


Asn 


Leu 


Gly 


Asp 


Ser 


Arg 






755 










Leu 


Ser 


Leu 


Thr 


Ser 


Ala 


Ser 




770 










775 


Arg 


Tyr 


Pro 


Tyr 


Ser 


Glu 


Leu 


785 










790 




Gin 


Asp 


Met 


Ala 


Met 


Cys 


Met 










805 






Lys 


Ala 


Leu 


Val 


Ala 


Cys 


Arg 








820 








Ala 


Glu 


Asp 


Tyr 


Leu 


Glu 


Val 



PCTAJS99/29996 
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360 365 



Tyr 


Gly 


Ala 


Glu 


Val 


He 


Leu 


Arg 


Lys 










380 










Gin 


Lys 


Gin 


Lys 


He 


Phe 


Gly 


Gly 


Thr 








395 










400 


Val 


Val 


Leu 


Glu 


Gly Gly 


Ser 


Cys 


Thr 






410 










415 




Val 


Thr 


Asn 


Val 


Pro Arg 


Val 


Pro 


Val 




425 










4 30 






Arg 


Ala 


Ala 


Asp 


Leu 


Leu 


Ala 


Phe 


Ala 


440 










445 








Gly 


Leu 


Leu 


Pro 


Asp 


Asp 


He 


Arg 


Arg 










460 










Thr 


Thr 


Phe 


Gly Cys 


Ser 


Glu 


Ala 


Ala 








475 










480 


Leu 


Thr 


Val 


Cys 


Ala 


Gin 


His 


Lys 


Asn 






4 90 










495 




Gly 


Glu 


Gin 


Gly Glu 


His 


Asp 


Val 


Asp 




505 










510 






Leu 


Lys 


Gly 


Gin 


Asn 


Leu 


Ser 


Ala 


Ala 


520 










525 








Ala 


Trp 


Asn 


Arg 


Val 


Asp 


He 


Ala 


Arg 










540 










His 


Glu 


Trp 


Pro 


Gin 


Ala 


Ala 


Leu 


His 








555 










560 


He 


His 


Asp 


Arg 


Val 


Asp 


Phe 


Val 


Arg 






570 










575 




Asn 


Met 


Gin 


Lys 


Phe 


Leu 


Thr 


He 


Ser 




585 










590 






Thr 


Asp 


Lys 


Gly 


Pro 


Pro 


Asn 


Thr 


Leu 


600 










605 








Val 


Arg 


Val 


Arg 


Gin 


Gly 


Tyr 


Arg 


Phe 










620 










Val 


He 


Glu 


Lys 


Leu 


Met 


Gly 


Asn 


Ser 








635 










640 


Ser 


Glu 


Phe 


Arg 


Asp 


Lys 


Tyr 


Lys 


Gin 






650 










655 




Ala 


Gin 


Lys 


Lys 


Ala 


Met 


Gly 


Val 


Phe 




665 










670 






Gly 


Ser 


Gly 


He 


Ala 


Ser 


Arg 


Gin 


Ser 


680 










685 








Gly 


Gly 


Gly 


Ser 


Ser 


Val 


Ala 


Gly 


Val 










700 










Gin 


Asp 


Pro 


Pro 


Leu 


Asp 


Pro 


His 


Val 








715 










720 


Ser 


Arg 


Ala 


Leu 


Ser 


Asn 


His 


He 


Leu 






730 










735 




Asn 


Phe 


Pro 


Ala 


Asn 


Pro 


Met 


Arg 


Pro 




745 










750 






Asp 


Cys 


Gly 


Ser 


Glu 


Phe 


Asp 


Glu 


Glu 


760 










765 








Asp 


Gly 


Ser 


Gin 


Thr 


Glu 


Pro 


Asp 


Phe 










780 










Met 


He 


Trp 


Ala 


Val 


Leu 


Thr 


Lys 


Arg 








795 










800 


Trp 


Gin 


His 


Gly 


Glu 


Glu 


Ala 


Met 


Ala 






810 










815 




Leu 


Tyr 


Lys 


Ser 


Leu 


Ala 


Thr 


Glu 


Ala 




825 










830 






Glu 


He 


Cys 


Glu 


Glu 


Leu 


Lys 


Lys 


Tyr 
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835 










84 G 










845 








Ala 


Glu 


Glu 


Phe 


Arg 


lie 


Leu 


Ser 


Leu 


Glu 


Leu 


Leu 


Asp 


His 


Cys 


Tyr 




850 










855 










860 










Kis 


Val 


Asp 


Asp 


Ala 


Gin 


Thr 


Leu 


Gin 


Leu 


Leu 


Thr 


Tyr 


Glu 


Leu 


Ser 


8 65 










870 










875 










880 


Asn 


Trp 


Ser 


Asn 


Glu 


Thr 


Cys 


Leu 


Ala 


Leu 


Ala 


Val 


lie 


Val 


Asn 


Asn 








885 










890 










895 




Lys 


His 


Phe 


Leu 


Ala 


His 


Pro 


Cys 


Cys 


Gin 


He 


Leu 


Leu 


Ala 


Asp 


Leu 








900 










905 










910 






Trp 


His 


Gly 


Gly 


Leu 


Arg 


Met 


Arg 


Thr 


His 


Ser 


Asn 


He 


Lys 


Val 


Val 






915 










920 










925 








Leu 


Gly 


Leu 


lie 


Cys 


Pro 


Pro 


Phe 


He 


Gin 


Met 


Leu 


Glu 


Phe 


Lys 


Thr 




930 










935 










940 










Arg 


Glu 


Glu 


Leu 


Leu 


Asn 


Gin 


Pro 


Gin 


Thr 


Ala 


Ala 


Glu 


His 


Gin 


Asn 


945 










950 










955 










960 


Asp 


Met 


Asn 


Tyr 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 








965 










970 










975 




Ser 


Ser 


Ser 


Ser 


Ser 


Asp 


Ser 


Ser 


Ser 


Phe 


Glu 


Asp 


Asp 


Asp 


Asp 


Glu 








980 










985 










990 






Asn 


Asn 


Ala 


His 


Asn 


His 


Asp 


Gin 


Lys 


Arg 


Thr 


Arg 


Lys 


Thr 


Ser 


Gin 






995 










1000 








1005 






Gly 


Ser 


Ala 


Gin 


Ser 


Leu 


Asn 


He 


Thr 


Ser 


Leu 


Phe 


His 


Ser 


Arg 


Arg 




1010 








1015 








1020 








Arg 


Lys 


Ala 


Lys 


Lys 


Asn 


Glu 


Lys 


Cys 


Asp 


Arg 


Glu 


Thr 


Asp 


Ala 


Ser 


1025 








1030 








1035 








104 


Ala 


Cys 


Glu 


Ala 


Gly 


Asn 


Arg 


Gin 


He 


Gin 


Asn 


Gly Gly 


Leu 


Thr 


Ala 










1045 








1050 








1055 


Glu 


Tyr 


Gly 


Thr 


Phe 


Gly 


Glu 


Ser 


Asn 


Gly Val 


Ser 


Pro 


Pro 


Pro 


Pro 








1060 








1065 








1070 




Tyr 


Met 


Arg 


Ala 


Asn 


Ser 


Arg 


Ser 


Arg 


Tyr 


Asn 


Asn 


Arg 


Ser 


Asp 


Met 






1075 








1080 








1085 






Ser 


Lys 


Thr 


Ser 


Ser 


Val 


He 


Phe 


Gly 


Ser 


Asp 


Pro 


Asn 


Leu 


Ser 


Lys 




1090 








1095 








1100 








Leu 


Gin 


Lys 


Ser 


Asn 


He 


Thr 


Ser 


Thr 


Asp Arg 


Pro 


Asn 


Pro 


Met 


Glu 


1105 








1110 








1115 








112 


Gin 


Phe 


Gin 


Gly 


Thr 


Arg 


Lys 


He 


Lys 


Met 


Arg 


Arg 


Arg 


Phe 


Tyr 


Glu 



1125 1130 1135 



Phe Tyr Ser Ala Pro He Ser Thr Phe Trp Ser Trp Thr He Ser Phe 

1140 1145 1150 

He Leu Phe He Thr Phe Phe Thr Tyr Thr Leu Leu Val Lys Thr Pro 

1155 1160 1165 

Pro Arg Pro Thr Val He Glu Tyr He Leu He Ala Tyr Val Ala 'Ala 

1170 1175 1180 

Phe Gly Leu Glu Gin Val Arg Lys He He Met Ser Asp Ala Lys Pro 
1185 1190 1195 120 

Phe Tyr Glu Lys He Arg Thr Tyr Val Cys Ser Phe Trp Asn Cys Val 

1205 1210 1215 

Thr He Leu Ala He He Phe Tyr He Val Gly Phe Phe Met Arg Cys 

1220 1225 1230 

Phe Gly Ser Val Ala Tyr Gly Arg Val He Leu Ala Cys Asp Ser Val 

1235 1240 1245 

Leu Trp Thr Met Lys Leu Leu Asp Tyr Met Ser Val His Pro Lys Leu 

1250 1255 1260 

Gly Pro Tyr Val Thr Met Ala Gly Lys Met He Gin Asn Met Ser Tyr 
1265 1270 1275 128 

He He Val Met Leu Val Val Thr Leu Leu Ser Phe Gly Leu Ala Arg 

1285 1290 1295 

Gin Ser He Thr Tyr Pro Asp Glu Thr Trp His Trp He Leu Val Arg 

1300 1305 1310 

Asn He Phe Leu Lys Pro Tyr Phe Met Leu Tyr Gly Glu Val Tyr Ala 
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1315 1320 1325 

Asp Glu He Asp Thr Cys Gly Asp Giu Ala Trp Asp Gin His Leu Glu 

1330 1335 1340 

Asn Gly Gly Pro Val He Leu Gly Asn Gly Thr Thr Gly Leu Ser Cys 
1345 1350 1355 136 

Val Pro Gly Tyr Trp He Pro Pro Leu Leu Met Thr Phe Phe Leu Leu 

1365 1370 1375 

He Ala Asn He Leu Leu Met Ser Met Leu He Ala He Phe Asn His 

1380 1385 1390 

He Phe Asp Ala Thr Asp Glu Met Ser Gin Gin He Trp Leu Phe Gin 

1395 1400 1405 

Arg Tyr Lys Gin Val Met Glu Tyr Glu Ser Thr Pro Phe Leu Pro Pro 

1410 1415 1420 

Pro Leu Thr Pro Leu Tyr His Gly Val Leu He Leu Gin Phe Val Arg 
1425 1430 1435 144 

Thr Arg Leu Ser Cys Ser Lys Ser Gin Glu Arg Asn Pro He Leu Leu 

1445 1450 1455 

Leu Lys He Ala Giu Leu Phe Leu Asp Asn Asp Gin He Glu Lys Leu 

1460 1465 1470 

His Asp Phe Glu Glu Asp Cys Met Glu Asp Leu Ala Arg Gin Lys Leu 

1475 1480 1485 

Asn Glu Lys Asn Thr Ser Asn Glu Gin Arg He Leu Arg Ala Asp He 

1490 1495 1500 

Arg Thr Asp Gin He Leu Asn Arg Leu He Asp Leu Gin Ala Lys Glu 
1505 1510 1515 152 

Ser Met Gly Arg Asp Val He Asn Asp Val Glu Ser Arg Leu Ala Ser 

1525 1530 1535 

Val Glu Lys Ala Gin Asn Glu He Leu Glu Cys Val Arg Ala Leu Leu 

1540 1545 1550 

Asn Gin Asn Asn Ala Pro Thr Ala He Gly Arg Cys Phe Ser Pro Ser 

1555 1560 1565 

Pro Asp Pro Leu Val Glu Thr Ala Asn Gly Thr Pro Gly Pro Leu Leu 

1570 1575 1580 

Leu Lys Leu Pro Gly Thr Asp Pro He Leu Glu Glu Lys Asp His Asp 
1585 1590 1595 160 

Ser Gly Glu Asn Ser Asn Ser Leu Pro Pro Gly Arg He Arg Arg Asn 

1605 1610 1615 

Arg Thr Ala Thr He Cys Gly Gly Tyr Val Ser Glu Glu Arg Asn Met 

1620 1625 1630 

Met Leu Leu Ser Pro Lys Pro Ser Asp Val Ser Gly He Pro Gin Gin 

1635 1640 1645 

Arg Leu Met Ser Val Thr Ser Met Asp Pro Leu Pro Leu Pro Leu Ala 

1650 1655 1660 

Lys Leu Ser Thr Met Ser He Arg Arg Arg His Glu Glu Tyr Thr Ser 
1665 1670 1675 168 

He Thr Asp Ser He Ala He Arg His Pro Glu Arg Arg He Arg Asn 

1685 1690 1695 

Asn Arg Ser Asn Ser Ser Glu His Asp Glu Ser Ala Val Asp Ser Glu 

1700 1705 1710 

Gly Gly Gly Asn Val Thr Ser Ser Pro Arg Lys Arg Ser Thr Arg Asp 

1715 1720 1725 

Leu Arg Met Thr Pro Ser Ser Gin Val Glu Glu Ser Thr Ser Arg Asp 

1730 1735 1740 

Gin He Phe Glu He Asp His Pro Glu His Glu Glu Asp Glu Ala Gin 
1745 1750 1755 176 

Ala Asp Cys Glu Leu Thr Asp Val He Thr Glu Glu Glu Asp Glu Glu 

1765 1770 1775 

Glu Asp Asp Glu Glu Asp Asp Ser His Glu Arg His His He His Pro 

1780 1785 1790 

Arg Arg Lys Ser Ser Arg Gin Asn Arg Gin Pro Ser His Thr Leu Glu 
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1795 1800 1805 

Thr Asp Leu Ser Glu Gly Glu Giu Val Asp Pro Leu Asp Val Leu Lys 

1810 1815 1820 

Met Lys Glu Leu Pro lie lie His Gin lie Leu Asn Glu Glu Glu Gin 
1825 1830 1835 184 

Ala Gly Ala Pro His Ser Thr Pro Val lie Ala Ser Pro Ser Ser Ser 

1845 1850 1855 

Arg Ala Asp Leu Thr Ser Gin Lys Cys Ser Asp Val 
1860 1865 

<210> 16 

<211> 489 

<212> DNA 

<213> Mus Musculus 



<400 
ccctgaaaga 
ccggggctgt 
ggctgcaggc 
agaatgtccc 
ataccaagct 
gtcagatgtt 
atgaaatact 
gtggaaaagc 
ggttctcat 



> 16 
ctcgacttct 
ccttgttcgg 
cgcggaggtg 
agaaatcctg 
ccaaagaccc 
tctgtggtcg 
cagatgtgaa 
acacggagca 



gctgctagcg 
tggcccgtgc 
gaggaggagc 
gatagagagc 
tcacagatgt 
tttggtcaag 
attgggtgaa 
gagcccaaca 



ctggagctga 
caccgcctcc 
cgctgccctt 
actttgacca 
cttccaggat 
caacatgcat 
cactttaacc 
gatgcttatg 



gttagttttg 
ggagacgctt 
ccggagtccg 
agagggagtg 
gtcagatttg 
gctttactgc 
aggcaataga 
gagtcatcaa 



agaaggtttc 
tccgatagat 
ccccgtgagg 
tgtatatatt 
tcagcaactt 
aagtcttgcc 
agaatggtct 
ttttcaaggg 



60 
120 
180 
240 
300 
360 
420 
480 
489 



<210> 17 
<211> 102 
<212> PRT 

<213> Mus Musculus 





<400> 


17 


Met 


Ser 


Gin 


Lys 


1 








Val 


Tyr 


He 


He 








20 


Cys 


Gin 


He 


Cys 






35 




Lys 


Gin 


His 


Ala 




50 






Val 


Lys 


Leu 


Gly 


65 








Glu 


Lys 


His 


Thr 


Phe 


Gin 


Gly 


Gly 



100 



Ser Trp He Glu 
5 

Pro Ser Ser Lys 

Gin Gin Leu Val 
40 

Cys Phe Thr Ala 
55 

Glu His Phe Asn 
70 

Glu Gin Ser Pro 
85 

Ser His 



Ser Thr Leu Thr 
10 

Asp Pro His Arg 

25 

Arg Cys Phe Cys 

Ser Leu Ala Met 
60 

Gin Ala He Glu 
75 

Thr Asp Ala Tyr 
90 



Lys Arg Glu Cys 
15 

Cys Leu Pro Gly 

30 

Gly Arg Leu Val 
45 

Lys Tyr Ser Asp 

Glu Trp Ser Val 
80 

Gly Val He Asn 
95 



<210> 18 
<211> 410 
<212> DNA 

<213> Homo Sapiens 
<220> 

<221> unsure 
<222> (6)... (6) 

<221> unsure 
<222> (58) . . . (58) 



<221> unsure 
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<221> unsure 

<222> (406) . . . (406) 

<400> 18 

gccgcnggag cctgagcgga gggtgtgcgc agcctcgcca gcgggggccc cgggctgngc 60 

cattgcctca ctgagccagc gcctgcctnc tacctcgccg acagctggaa ccagtgcgac 120 

ctagtggctc rcacctgctt cctcctgggc gtgggctgcc ggctgacccc gggtttgtac 180 

cacctgggcc gcactgtcct ctgcatcgac ttcatggttt tcacggtgcg gctgcttcac 240 

atcttcacgg tcaacaaaca gctggggccc aagatcgtca tcgtgagcaa gatgatgaag 300 

gacgtgttct tcttcctctt cttcctcggc gtgtggctgg tagctatggg ttgggccacg 360 

gaggggttcc tgaggccacg ggacagtgac ttcccaagta tcctgncgcc 410 

<210> 19 
<211> 131 
<212> PRT 

<213> Homo Sapiens 
<220> 

<221> UNSURE 
<222> (15) . . . (15) 
<223> UNKNOWN 

<221> UNSURE 
<222> (25) . . . (25) 
<223> UNECNOWN 

<221> UNSURE 

<222> (131) . . . (131) 

<223> UNKNOWN 

<400> 19 



Ala 


Glu 


Gly 


Val 


Arg 


Ser 


Leu 


Ala 


Ser 


Gly 


Gly 


Pro 


Gly 


Leu 


Xaa 


His 


1 






5 










10 










15 




Cys 


Leu 


Thr 


Glu 


Pro 


Ala 


Pro 


Ala 


Xaa 


Tyr 


Leu 


Ala 


Asp 


Ser 


Trp 


Asn 








20 










25 










30 






Gin 


Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly 


Val 


Gly 


Cys 






35 










40 










45 








Arg 


Leu 


Thr 


Pro 


Gly 


Leu 


Tyr 


His 


Leu 


Gly 


Arg 


Thr 


Val 


Leu 


Cys 


He 




50 










55 










60 










Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 


Asn 


65 










70 










75 










80 


Lys 


Gin 


Leu 


Gly 


Pro 


Lys 


He 


Val 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 


Asp 








85 










90 










95 




Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Gly 


Val 


Trp 


Leu 


Val 


Ala 


Met 


Gly 








100 










105 










110 






Trp 


Ala 


Thr 


Glu Gly 


Phe 


Leu 


Arg 


Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 


Ser 



115 120 125 



He Leu Xaa 
130 

<210> 20 

<211> 389 

<212> DNA 

<213> Homo Sapiens 

<400> 20 

caaatttttt gttagtacac catctcatcc aaattgcaaa agtcacatgg aaactggaac 60 
caaagatcaa gaaactgttt gctctaaagc tacagaagga gataatacag aatttggagc 120 
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atttgtagga cacagagata gcatggat-t acagaggttt aaagaaacat caaacaagat 180 

aaaaatacta tccaataaca atacttctga aaacactttg aaacgagtga gttctcttgc 240 

tggatttact gactgtcaca gaacttccat tcctgttcat tcaaaacgag aaaagatcag 300 

tagaaggcca tctaccgaag acactcatga agtagattcc aaagcagctt taataccggt 360 

ttgtagattt caactaaaca gatatatat 389 



<210> 21 

<211> 415 

<212> DNA 

<213> Homo Sapiens 



<400> 21 

atttctagtt tttcaaattt gccagtcttt ttgaatagta tctccttctt ttctcatgtt 60 

ttatatttaa aactttttta tgtccatcat cactttaaac atacttattt tgtcatctat 120 

aaccaataat tccactatct tatcagaaat caaataccgt ttatgtaagt tgactcccat 180 

gagttctaaa ttgccattgt gaggtcatct tcggttaggc tttaatttgt tgcaaagttg 240 

tgcagctcag ggtcaggaag agtccctcca gaaaggagga tttgttactg tgaatctctt 300 

tgttaactaa cctctttccc cactgaaata acttttttca ataacatgat tttaacaaca 360 

taatctctct atgccagaac agatatatat gaatgtaagt caatattttc ttgag 415 



<210> 22 

<211> 405 

<212> DNA 

<213> Mus Musculus 



tgcctcctcc tcttatcatc ctcagccata 60 

gaagaaagaa agataagact tccgatgggc 120 

agaaactcca tgattttgaa gagcagtgtg 180 

aattcaattc tgggagtgaa gagagaatcc 240 

gcattcagat taaagaagtt ggagatcgtg 300 

tagattctca aattggtcat ctgcaagatc 360 

cacttacagc ccaga 405 

<210> 23 

<211> 5117 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 

<222> (2382) . . . (2382) 

<223> unknown 

<221> unsure 

<222> (4664) . . . (4664) 

<223> unknown 

<221> unsure 

<222> (4682) . . . (4682) 

<223> unknown 

<221> unsure 

<222> (4702) . . . (4702) 

<223> unknown 



<400> 22 

ttattatggc ttatcatgaa aaaccagtcc 
tagtttcact gttttgctgt gtatgcaaaa 
caaaactttt: cttaacagaa gaagatcaaa 
ttgagatgta ctttgatgag aaagatgaca 
gggtcacttt tgaaagagtg gagcagatga 
tcaactacat aaaaagatca ttacagtctt 
tctcagccct aacagtagat acattgaaaa 



<221> unsure 

<222> (5038) ... (5039) 

<223> unknown 
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<221> unsure 

<222> (5056) . . . (5056) 

<223> un)cnown 

<221> unsure 

<222> (5071) . . , (5072) 

<400> 23 

gatggcaaca tggtgaagaa tcaatggcta aagcattagt tgcctgtaag atctatcgtt 60 

caatggcata tgaagcaaag cagagtgacc tggtagatga tacttcagaa gaactaaaac 120 

agtattccaa tgattttggt cagtrggccg ttgaattatt agaacagtcc ttcagacaag 180 

atgaaaccat ggctatgaaa ttgctcactt atgaactgaa gaactggagt aattcaacct 240 

gccttaagtt agcagtttct tcaagactra gaccttttgt agctcacacc tgtacacaaa 300 

tgttgttatc tgatatgtgg atgggaaggc tgaatatgag gaaaaattcc tggtacaagg 360 

tcatactaag cattttagtt ccacctgcca tatrgctgtt agagtataaa actaaggctg 420 

aaatgtccca tatcccacaa tctcaagatg ctcatcagat gacaatggat gacagcgaaa 4 80 

acaactttca gaacataaca gaagagatcc ccatggaagt gtttaaagaa gtacggattt 54 0 

tggatagtaa tgaaggaaag aatgagatgg agatacaaat gaaatcaaaa aagcttccaa 600 

ttacgcgaaa gttttatgcc ttttatcatg caccaattgt aaaattctgg tttaacacgt 660 

tggcatattt aggatttctg atgctttata catttgtggt tcttgtacaa atggaacagt 720 

taccttcagt tcaagaatgg attgttattg cttatatttt tacttatgcc attgagaaag 780 

tccgtgagat ctttatgtct gaagctggga aagtaaacca gaagattaaa gtatggttta 84 0 

gtgattactt caacatcagt gatacaattg ccataatttc tttcttcatt ggatttggac 900 

taagatttgg agcaaaatgg aactttgcaa atgcatatga taatcatgtt tttgtggctg 960 

gaagattaat ttactgtctt aacataatat tttggtatgt gcgtttgcta gattttctag 1020 

ctgtaaatca acaggcagga ccttatgtaa tgatgattgg aaaaatggtg gccaatatgt 108 0 

tctacattgt agtgattatg gctcttgtat tacttagttt tggtgttccc agaaaggcaa 114 0 

tactttatcc tcatgaagca ccatcttgga ctcttgctaa agatatagtt tttcacccat 1200 

actggatgat ttttggtgaa gtttatgcat acgaaattga tgtgtgtgca aatgattctg 1260 

ttatccctca aatctgtggt cctgggacgt ggttgactcc atttcttcaa gcagtctacc 1320 

tctttgtaca gtatatcatt atggttaatc ttcttattgc atttttcaac aatgtgtatt 1380 

tacaagtgaa ggcaatttcc aatattgtat ggaagtacca gcgttatcat tttattatgg 1440 

cttatcatga gaaaccagtt ctgcctcctc cacttatcat tcttagccat atagtttctc 1500 

tgttttgctg catatgtaag agaagaaaga aagataagac ttccgatgga ccaaaacttt 1560 

tcttaacaga agaagatcaa aagaaacttc atgattttga agagcagtgt gttgaaatgt 1620 

atttcaatga aaaagatgac aaatttcatt ctgggagtga agagagaatt cgtgtcact.t 1680 

ttgaaagagt ggaacagatg tgcattcaga ttaaagaagt tggagatcgt gtcaactaca 1740 

taaaaagatc attacaatca ttagattctc aaattggcca tttgcaagat ctttcagccc 1800 

tgacggtaga tacattaaaa acactcactg cccagaaagc gtcggaagct agcaaagttc 1860 

ataatgaaat cacacgagaa ctgagcattt ccaaacactt ggctcaaaac cttattgatg 1920 

atggtcctgt aagaccttct gtatggaaaa agcatggtgt tgtaaataca cttagctcct 1980 

ctcttcctca aggtgatctt gaaagtaata atccttttca ttgtaatatt ttaatgaaag 2040 

atgacaaaga tccccagtgt aatatatttg gtcaagactt acctgcagta ccccagagaa 2100 

aagaatttaa ttttccagag gctggttcct cttctggtgc cttattccca agtgctgttt 2160 

cccctccaga actgcgacag agactacatg gggtagaact cttaaaaata tttaataaaa 2220 

atcaaaaatt aggcagttca tctactagca taccacatct gtcatcccca ccaaccaaat 2280 

tttttgttag tacaccatct cagccaagtt gcaaaagcca cttggaaact ggaaccaaag 234 0 

atcaagaaac tgtttgctct aaagctacag aaggagataa tncagaattt ggagcatttg 2400 

taggacacag agatagcatg gatttacaga ggtttaaaga aacatcaaac aagataaaaa 24 60 

tactatccaa taacaatact tctgaaaaca ctttgaaacg agtgagttct cttgctggat 2520 

ttactgactg tcacagaact tccattcctg ttcattcaaa acaagcagaa aaaatcagta 2580 

gaaggccatc taccgaagac actcatgaag tagattccaa agcagcttta ataccggatt 264 0 

ggttacaaga tagaccatca aacagagaaa tgccatctga agaaggaaca ttaaatggtc 2700 

tcacttctcc atttaagcca gctatggata caaattacta ttattcagct gtggaaagaa 27 60 

ataacttgat gaggttatca cagagcattc catttacacc tgtgcctcca agaggggagc 2820 

ctgtcacagt gtatcgtttg gaagagagtt cacccaacat actaaataac agcatgtctt 288 0 

cttggtcaca actaggcctc tgtgccaaaa tagagttttt aagcaaagag gagatgggag 294 0 

gaggtttacg aagagctgtc aaagtacagt gtacctggtc agaacatgat atcctcaaat 3000 

cagggcatct ttatattatc aaatcttttc ttccagaggt ggttaataca tggtcaagta 3060 

tttataaaga agatacagtt ctgcatctct gtctgagaga aattcaacaa cagagagcag 3120 
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cacaaaagct 
ggttccttga 
aatgtatgac 
ctaatactct 
gggagttact 
taaaagcaga 
atgcaattaa 
ttccagatct 
cttcagattt 
ttcgtctgat 
aatgttactg 
gcaagttgaa 
atcagtcaga 
taattagtat 
tgttttgatg 
gtccatgcag 
cacgcagcct 
tgatgatatc 
tctaattcta 
cggtatttat 
atagaaccca 
actgttaagt 
acctaactca 
atctgtatct 
tatatcaaat 
ttgaacatga 
anactggcag 
ttgttgaata 
ttttcttttt 
gggagcagta 
gagatgataa 
tgtattactt 
ataaatttat 
gatccctcga 



tacgtttgcc 
agttttcctg 
tggagaattt 
ggaagagatc 
ggtacttgat 
agaaaagaga 
aaacttcaga 
gaagaggaat 
gaatcttcag 
gttataatat 
tgtcactttt 
tttgctgact 
aagtccctat 
ttgtaacttt 
agggtgatgc 
attttattgg 
tttggttaaa 
actggttccc 
ggttcacaga 
gaatcagggt 
aattgtgtaa 
agaagaaagc 
ctcgtcaagg 
atagcaacaa 
cagaatgtaa 
atgctgtttg 
attgcagtgc 
ttcaacaata 
ttttcttttt 
taaagcttat 
gctgtttaaa 
aaagatacta 
tgtttncaaa 
ggggccc 



tttaatcaaa 
ctgtattgcc 
agaaaataca 
atgctagcct 
ttgcaaggtg 
tcctgtgata 
gcaaaacatc 
gattatacgc 
cctggaaatt 
taatattact 
ccctcgggag 
ccagcacagt 
aggatagagc 
ttaaagggct 
cctctcttat 
cagctgtttt 
tggcagtcaa 
aactagatgc 
ggtatgagaa 
taggctccat 
aaatgttaat 
catggtaaac 
gaagttacct 
cccagaactt 
acatgaactc 
ggtacttaag 
anccttatgc 
acaagaaaag 
cttttgtttc 
ttgtatttag 
tagtgtttaa 
tatacgtttt 
aaaaaaaccc 
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tgaaacccaa 
attcagcagg 
acaataataa 
ttagccactg 
ttggtgaaaa 
tggtttttgg 
actgtaattc 
ctgataaaat 
ccaccaaaga 
gaatcattgg 
gaaattgttt 
taaaaggtca 
tggcagctga 
ctttgtatag 
gtggtacaat 
attgccattc 
aattttcctc 
ttgttggcca 
gcctgaactg 
atttaaagat 
tgggtttttt 
tgatacataa 
tttgaggaaa 
acaaacttct 
ttgcatatat 
aaattrattc 
caataaaatg 
cttttcatct 
cttggtacta 
tagtgtatct 
tattgattgg 
gtatatcatt 
nntaaaaaaa 



atccatacca 
acagtggttt 
tggagatgag 
gacttacgaa 
tttgactgac 
cccagcaaat 
ttgctgtaga 
tatatttcct 
at cagaatca 
ttttgcctgc 
ggtaatatag 
atattctttt 
gaaattttaa 
cagaggatct 
accattaacc 
aactagggaa 
agtgtattta 
cgggaaggga 
aagaccattt 
agagccagtt 
aaacattgtt 
cctaaattat 
gttaaagtac 
ccaaagattt 
ttaaaattgt 
agtnggatta 
taatttaaca 
aagttttatg 
attttaattt 
catagataca 
gggtggggag 
aaatctttaa 
aaagggcggc 



tattctccaa 
gctgtggaag 
attattccaa 
tatacaagag 
ccatctgtga 
ctaggagaag 
aagcttaaac 
caggatgagc 
gctaattctg 
acctcacaga 
aaaggtgtat 
gacctgatta 
aggtaattga 
catttgactt 
aaaggtaggt 
atgaagaaat 
gtgtgttcag 
aatgacttgt 
tcaagaggga 
ttttttttaa 
ttatcaagtc 
aaaagcagaa 
ttttttccct 
tattgattgt 
gttggaacat 
tcattatgtg 
gccccagata 
ctttaatttt 
ttatttggaa 
gacaaggcaa 
aaagaaaaag 
aagaaatnna 
ccctctagag 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5117 



<210> 24 
<211> 1224 
<212> PRT 

<213> Homo Sapiens 
<220> 

<221> UNSURE 

<222> (794) . . . (794) 

<223> UNKNOWN 



<400> 24 



Trp 


Gin 


His 


Gly 


Glu 


Glu 


Ser 


Met 


Ala 


Lys 


Ala 


Leu 


Val 


Ala 


Cys 


Lys 


1 






5 










10 










15 




lie 


Tyr 


Arg 


Ser 


Met 


Ala 


Tyr 


Glu 


Ala 


Lys 


Gin 


Ser 


Asp 


Leu 


Val 


Asp 




20 










25 










30 






Asp 


Thr 


Ser 


Glu 


Glu 


Leu 


Lys 


Gin 


Tyr 


Ser 


Asn 


Asp 


Phe 


Gly 


Gin 


Leu 




35 










40 










45 








Ala 


Val 
50 


Glu 


Leu 


Leu 


Glu 


Gin 
55 


Ser 


Phe 


Arg 


Gin 


Asp 
60 


Glu 


Thr 


Met 


Ala 


Met 


Lys 


Leu 


Leu 


Thr 


Tyr 


Glu 


Leu 


Lys 


Asn 


Trp 


Ser 


Asn 


Ser 


Thr 


Cys 


65 








70 










75 










80 


Leu 


Lys 


Leu 


Ala 


Val 


Ser 


Ser 


Arg 


Leu 


Arg 


Pro 


Phe 


Val 


Ala 


His 


Thr 








85 










90 










95 
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Cys 


1 nr 


fZ^ rt 
O J. li 


Met 


Leu 


Leu 


Ser 


Asp 


Met 


Tro 


Met 


Gly Arg 


Leu 


Asn 


Met 






100 










105 










110 








jjys 


Asn 


Ser 




1 yx 


Lys 


Val 


He 


Leu 


Ser 


He 


Leu 


Val 


Pro 


Pro 


lis 










120 










125 










lie 


Leu 


Leu 


Leu 


Glu 


T vr 
1 yx 


Lys 


Thr 


Lys 


Ala 


Glu 


Met 


Ser 


His 


He 




uu 










135 




















fro 




C A r* 

oer 




Asp 


Ala 


His 


Gin 


Met 


Thr 


Met 


Asp 


Asp 


Ser 


Glu 


Asn 










150 










155 










160 


Asn 


Phe 


Gin 


Asn 


i±e 


i ilx 


UX li 


Glu 


He 


Pro 


Met 


^^1 11 

OX u 


Val 
Vox 


Phe 


Lys 


Glu 










X 










170 










175 




Val 


Arg 


lie 


Leu 


Asp 


Cat* 

oer 


Asn 


UX u 


va X y 


Lys 


Asn 


r:i 11 

UXU 




Glu 


He 


Gin 






1 Rn 










185 










190 






Met 


Lys 


Ser 


Lys 


Lys 


Leu 


Pro 


He 


Thr 


- 


Lys 


PVk A 


Tyr 


Ala 


Phe 


Tvr 




ly o 










200 


















His 


Ala 


Pro 


i±e 


vai 


Lys 




Trp 


Phe 


Asn 


Thr 


Leu 


m a 

nXo 


1 yx 


Leu 


Gly 




210 










^ X «J 










o o n 










Phe 


Leu 


Met 


Leu 


Tyr 


1 nr 


fne 


Vox 


Vox 


L u 
eu 


Val 


oin 


ne I. 


Glu 


Gin 


Leu 


225 


















235 










240 


Pro 


Ser 


Val 


Gin 


Glu 


Trp 


T 1 a 

lie 


vai 


lie 


ai a 

M.Xa 


Tyr 


lie 


irne 


X IIX 


Tyr 


Ala 




















250 










255 




lie 


Glu 


Lys 


vax 


Arg 


IjxU 


T 1 A 

1 xe 


It ne 




OCX 


uxu 


AJ.a 


oiy 


Lys 


Val 


Asn 
















^ o ^ 










270 






Gin 


Lys 


He 


Lys 


vax 


Trp 


fne 


C a -r- 


Asp 


Tyr 




Asn 


T 1 irk 
XX© 


Ser 


Asp 


Thr 




Z f o 










280 










O D 








lie 


Ala 


He 


Tin 

lie 


Ser 


fne 


IT ne 


X xe 


uxy 


Phe 


Gly 


Leu 


Arg 


Phe 


Gly Ala 




290 






























Lys 


Trp 


Asn 


Pne 


Ai.a 


Asn 


nXo 


Tyr 


Asp 


sn 


His 


vai 


PHa 

irne 


Val 


Ala 


Gly 


305 


















315 










320 


Arg 


Leu 


He 


Tyr 


Cys 


Leu 


Asn 


T 1 o 

X xe 


T 1 o 
XX6 


It 116 


Trp 


Tyr 


Vox 


Arg 


Leu 


Leu 


















330 










335 




Asp 


Pne 


Leu 


Aia 


vai 


Asn 


m n 


oxf i 


nXO 


Gly 


Pro 


Tyr 


Va 1 
Vox 


Met 


Met 


He 






4 n 










345 










350 






1 »» 


Lys 


me I. 


Vox 




sn 


Met 


Phe 


J. yi 


He 


Val 


\7a 1 
Vox 


Tl o 
xxe 


Met 


Ala 


Leu 












360 


















vax 


Leu 


Leu 


oer 


t ne 


uxy 


Va 1 
Vox 


Pro 


rg 


- 

ys 


Ala 


xxe 


Leu 


Tvr 

X y J. 


Pro 


His 




"3 T n 
J / U 
































Axa 


Pro 


oer 


Trp 


X iiX 


Leu 


Ala 


Lys 


Asp 


He 


VaX 


Phe 
XT lie 


His 


Pro 


Tyr 


385 










390 










395 










400 


Trp 


Met 


1 X e 


irne 




oXU 


V d X 


1 yi: 


Ala 


i yx 


Glu 


Tie 

xxe 


Asp 


Val 


Cys 


Ala 








405 










410 










415 




Asn 


Asp 


ber 


vax 


1 xe 


Pro 


VnXIl 


X Xc= 


Cys 


Gly 


Pro 


ox y 


Thr 


Tm 


Leu 


Thr 






4 9 n 

^ w 










425 










430 






Pro 


fne 


Leu 


oxn 


Zi 1 a 
MX a 


V aX 


T V7T" 

L yx 


Leu 


Phe 


Val 


Gin 


Tyr 


He 


He 


Met 


Val 






435 










4 4 0 










4 45 








Asn 


Leu 


Leu 


X X€ 


MX a 


Phe 


Phe 


Asn 


As n 


Val 


Tvr 

i. y j- 


Leu 


Gin 


Val 


Lys 


Ala 




450 










455 




















T 1 ^ 


oer 


As n 


xxe 


V dX 


Trp 


Lys 


1 yx 


Gin 


Arg 


J. y X 


nx o 


Phe 


He 


Met 


Ala 


H OO 










470 










475 










480 


Tyr 




Pill 

uX u 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Leu 


Ser 


His 






485 










490 










4 95 




He 


Val 


Ser 


Leu 


Phe 


Cys 


Cys 


He 


Cys 


Lys 


Arg 


Arg 


Lys 


Lys 


Asp 


Lys 








500 










505 










510 






Thr 


Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 


Gin 


Lys 


Lys 






515 










520 










525 








Leu 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


Val 


Glu 


Met 


Tyr 


Phe 


Asn 


Glu 


Lys 




530 








535 










540 










Asp 


Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


Phe 


545 








550 










555 










560 


Glu 


Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly Asp 


Arg 








565 










570 










575 
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Val 


Asn 


Tyr 


He 


Lys 


Arg 


Ser 


Leu 


k3±n 


Ser 


Leu 


Asp 


Ser 


{^1 n 
O-L 1 1 


T 1 ea 

i. J.e 










c. o r\ 










Do O 
















His 


Leu 


Gin 


Asp 


Leu 


Ser 


Ala 


Leu 


Thr 


vaj. 


Asp 


Thr 


Leu 


Lys 


1 nr 


Leu 






595 










OUU 










605 








Thr 


Ala 


Gin 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 


oXU 


1 J.e 


1 nr 




610 










615 










620 










Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 


Leu 


ixe 


TV 

Asp 


Asp 


625 










630 










635 












Gly 


Pro 


Val 


Arg 


Pro 


Ser 


Val 


Trp 


Lys 


Lys 


His 


Gly Val 


Val 


Asn 


Thr 










645 










650 










ODD 




Leu 


Ser 


Ser 


Ser 


Leu 


Pro 


Gin 


Gly 


Asp 


Leu 


Glu 


Ser 


Asn 


Asn 


Pro 


Phe 








660 










665 










670 






His 


Cys 


Asn 


He 


Leu 


Met 


Lys 


Asp 


Asp 


Lys 


Asp 


Pro 


Gin 


Cys 


Asn 


He 






675 










680 










685 








Phe 


Gly 


Gin 


Asp 


Leu 


Pro 


Ala 


Val 


Pro 


Gin 


Arg 


Lys 


Glu 


Phe 


Asn 


Phe 




690 










695 










700 










Pro 


Glu 


Ala 


Gly 


Ser 


Ser 


Ser 


Gly 


Ala 


Leu 


Phe 


Pro 


Ser 


Ala 


Val 


Ser 


705 










710 










715 










-7 O A 


Pro 


Pro 


Glu 


Leu 


Arg 


Gin 


Arg 


Leu 


His 


Gly 


Val 


Glu 


Leu 


Leu 


Lys 


He 










725 










730 










735 




Phe 


Asn 


Lys 


Asn 


Gin 


Lys 


Leu 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


He 


Pro 


His 








740 










745 










750 






Leu 


Ser 


Ser 


Pro 


Pro 


Thr 


Lys 


Phe 


Phe 


Val 


Ser 


Thr 


Pro 


Ser 


Gin 


Pro 






755 










760 










765 








Ser 


Cys 


Lys 


Ser 


His 


Leu 


Glu 


Thr 


Gly 


Thr 


Lys 


Asp 


Gin 


Glu 


Thr 


Val 




770 










775 










780 










Cys 


Ser 


Lys 


Ala 


Thr 


Glu 


Gly 


Asp 


Asn 


Xaa 


Glu 


Phe 


Gly Ala 


Phe 


Val 


785 










790 










7 95 










Q r\ r\ 


Gly His 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


Thr 


Ser 


Asn 










805 










810 










O 1 c 

o 15 




Lys 


He 


Lys 


He 


Leu 


Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


Glu 


Asn 


Thr 


Leu 


Lys 




820 










825 










830 






Arg Val 


Ser 


Ser 


Leu 


Ala 


Gly 


Phe 


Thr 


Asp 


Cys 


His 


Arg 


Thr 


Ser 


He 






835 










840 










845 








Pro 


Val 


His 


Ser 


Lys 


Gin 


Ala 


Glu 


Lys 


He 


Ser 


Arg 


Arg 


Pro 


Ser 


Thr 




850 










855 










860 










Glu 


Asp 


Thr 


His 


Glu 


Val 


Asp 


Ser 


Lys 


Ala 


Ala 


Leu 


He 


Pro 


Asp 


Trp 


865 










870 










O T C 

8 7 5 










0 0 A 


Leu 


Gin 


Asp 


Arg 


Pro 


Ser 


Asn 


Arg 


Glu 


Met 


Pro 


Ser 


Glu 


Glu 


Gly 


Thr 










885 










Q o r\ 
o y U 










R Q C, 




Leu 


Asn 


Gly 


Leu 


Thr 


Ser 


Pro 


Phe 


Lys 


Pro 


Ala 


Met 


Asp 


Thr 


Asn 


Tyr 








900 










905 










910 






Tyr 


Tyr 


Ser 


Ala 


Val 


Glu 


Arg 


Asn 


Asn 


Leu 


Met 


Arg 


Leu 


Ser 


Cjin 


Ser 






915 










920 










925 








He 


Pro 


Phe 


Thr 


Pro 


Val 


Pro 


Pro 


Arg 


Gly 


Glu 


Pro 


Val 


Thr 


vai 


Tyr 




930 










935 










94 0 










Arg 


Leu 


Glu 


Glu 


Ser 


Ser 


Pro 


Asn 


He 


Leu 


Asn 


Asn 


Ser 


Met 


o er 


Ser 


945 










950 










955 












Trp 


Ser 


Gin 


Leu 


Gly 


Leu 


Cys 


Ala 


Lys 


He 


Glu 


Phe 


Leu 


Ser 


Lys 


Glu 










965 










970 










975 




Glu 


Met 


Gly 


Gly 


Gly 


Leu 


Arg 


Arg, 


Ala 


Val 


Lys 


Val 


Gin 


Cys 


Thr 


Trp 








980 










985 










990 






Ser 


Glu 


His 


Asp 


He 


Leu 


Lys 


Ser 


Gly 


His 


Leu 


Tyr 


He 


He 


Lys 


Ser 






995 










1000 








1005 






Phe 


Leu 


Pro 


Glu 


Val 


Val 


Asn 


Thr 


Trp 


Ser 


Ser 


He 


Tyr 


Lys 


Glu 


Asp 




1010 








1015 








1020 








Thr 


Val 


Leu 


His 


Leu 


Cys 


Leu 


Arg 


Glu 


He 


Gin 


Gin 


Gin 


Arg 


Ala 


Ala 


1025 








1030 








1035 








104 


Gin 


Lys 


Leu 


Thr 


Phe 


Ala 


Phe 


Asn 


Gin 


Met 


Lys 


Pro 


Lys 


Ser 


He 


Pro 








1045 








1050 








1055 
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Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His Ser Ala 

1060 1065 1070 

Gly Gin Tro Phe Ala Val Glu Glu Cys Met Thr Gly Glu Phe Arg Lys 

1075 1080 1085 

Tyr Asn Asn Asn Asn Gly Asp Glu lie lie Pro Thr Asn Thr Leu Glu 

1090 1095 1100 

Glu lie Met Leu Ala Phe Ser His Trp Thr Tyr Glu Tyr Thr Arg Gly 
1105 1110 1115 112 

Glu Leu Leu Val Leu Asp Leu Gin Gly Val Gly Glu Asn Leu Thr Asp 

1125 1130 1135 

Pro Ser Val He Lys Ala Glu Glu Lys Arg Ser Cys Asp Met Val Phe 

1140 1145 1150 

Gly Pro Ala Asn Leu Gly Glu Asp Ala He Lys Asn Phe Arg Ala Lys 

1155 1160 1165 

His His Cys Asn Ser Cys Cys Arg Lys Leu Lys Leu Pro Asp Leu Lys 

1170 1175 1180 

Arg Asn Asp Tyr Thr Pro Asp Lys He He Phe Pro Gin Asp Glu Pro 
1185 1190 1195 120 

Ser Asp Leu Asn Leu Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu Ser 

1205 1210 1215 

Ala Asn Ser Val Arg Leu Met Leu 
1220 



<210> 25 

<211> 2180 

<212> DNA 

<213> Homo Sapiens 



<400> 25 

tcgaggccaa gaattcggca cgagggcctc gggcaggccc cctggagcga cctgcttctt 60 

tgggcactgt tgctgaacag ggcacagatg gccatgtact tctgggagat gggttccaat 120 

gcagtttcct cagctcttgg ggcctgtttg ctgctccggg tgatggcacg cctggagcct 18 0 

gacgctgagg aggcagcacg gaggaaagac ctggcgttca agtttgaggg gatgggcgtt 24 0 

gacctctttg gcgagtgcta tcgcagcagt gaggtgaggg ctgcccgcct cctcctccgt 300 

cgctgcccgc tctgggggga tgccacttgc ctccagctgg ccatgcaagc tgacgcccgt 360 

gccttctttg cccaggatgg ggtacagtct ctgctgacac agaagtggtg gggagatatg 420 

gccagcacta cacccatctg ggccctggtt ctcgccttct tttgccctcc actcatctac 480 

acccgcctca tcaccttcag gaaatcagaa gaggagccca cacgggagga gctagagttt 540 

gacatggata gtgtcattaa tggggaaggg cctgtcggga cggcggaccc agccgagaag 600 

acgccgctgg gggtcccgcg ccagtcgggc cgtccgggtt gctgcggggg ccgctgcggg 660 

gggcgccggt gcctacgccg ctggttccac ttctggggcg cgccggtgac catcttcatg 720 

ggcaacgtgg tcagctacct gctgttcctg ctgcttttct cgcgggtgct gctcgtggat 780 

ttccagccgg cgccgcccgg ctccctggag ctgctgctct atttctgggc tttcacgctg 840 

ctgtgcgagg aactgcgcca gggcctgagc ggaggcgggg gcagcctcgc cagcgggggc 900 

cccgggcctg gccatgcctc actgagccag cgcctgcgcc tctacctcgc cgacagctgg 960 

aaccagtgcg acctagtggc tctcacctgc ttcctcctgg gcgtgggctg ccggctgacc 1020 

ccgggtttgt accacctggg ccgcactgtc ctctgcatcg acttcatggt tttcacggtg 1080 

cggctgcttc acatcttcac ggtcaacaaa cagctggggc ccaagatcgt catcgtgagc 1140 

aagatgatga aggacgtgtt cttcttcctc ttcttcctcg gcgtgtggct ggtagcctat 1200 

ggcgtggcca cggaggggct cctgaggcca cgggacagtg acttcccaag tatcctgcgc 1260 

cgcgtcttct accgtcccta cctgcagatc ttcgggcaga ttccccagga ggacatggac 1320 

gtggccctca tggagcacag caactgctcg tcggagcccg gcttctgggc acaccctcct 1380 

ggggcccagg cgggcacctg cgtctcccag tatgccaact ggctggtggt gctgctcctc 144 0 

gtcatcttcc tgctcgtggc caacatcctg ctggtcaact tgctcattgc catgttcagt 1500 

tacacattcg gcaaagtaca gggcaacagc gatctctact ggaaggcgca gcgttaccgc 15 60 

ctcatccggg aattccactc tcggcccgcg ctggccccgc cctttatcgt catctcccac 1620 

ttgcgcctcc tgctcaggca attgtgcagg cgacccsgga gcccccagcc gtcctccccg 168 0 

gccctcgagc atttccgggt ttacctttct aaggaagccg agcggaagct gctaacgtgg 174 0 

gaatcggtgc ataaggagaa ctttctgctg gcacgcgcta gggacaagcg ggagagcgac 1800 

tccgagmgtc tgaagcgcac gtcccagaag gtggacttgg cactgaaaca gctgggacac 1860 
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atccgcgagr acgaacagcg cctgaaagtg 
gcccccgcac ctggtggcct tgtccttgag 
aggaccacct ttgggagtgt catccttaca 
cagtcccagc ctgggaggat caaggcctgg 
agggtccttg gggtaacagg gaccacagac 
ggaaataaag ccatttcaga 
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ctggagcggg aggtccagca gtgtacctcg 1920 
gtgagcccca tgtccatctg ggccactgtc 1980 
aaccacagca tgcccggctc ctcccagaac 2040 
atcccrggcc gttatccatc tggaggctgc 2100 
ccctcaccac tcacagattc ctcacactgg 2160 

2180 



<210> 26 
<211> 725 
<212> PRT 

<213> Homo Sapiens 
<220> 

<221> UNSURE 

<222> (553) . . . (553) 

<223> UNKNOWN 



<221> UNSURE 

<222> (603) . . . (603) 

<223> UNKNOWN 





<400> 


26 


Ser 


Arg 


Pro 


Arg 


1 








Asp 


Leu 


Leu 


Leu 








20 


Tyr 


Phe 


Trp 


Glu 






35 




Cys 


Leu 


Leu 


Leu 




50 






Ala 


Ala 


Arg 


Arg 


65 








Asp 


Leu 


Phe 


Gly 


Leu 


Leu 


Leu 


Arg 








100 


Leu 


Ala 


Met 


Gin 






115 




Gin 


Ser 


Leu 


Leu 




130 






Pro 


lie 


Trp 


Ala 


145 








Thr 


Arg 


Leu 


He 


Glu 


Leu 


Glu 


Phe 








180 


Gly 


Thr 


Ala 


Asp 






195 




Ser 


Gly 


Arg 


Pro 




210 






Leu 


Arg 


Arg 


Trp 


225 








Gly 


A.sn 


Val 


Val 


Leu 


Leu 


Val 


Asp 








260 


Leu 


Tyr 


Phe 


Trp 






275 




Leu 


Ser 


Gly 


Gly 




290 







He Arg His Glu 
5 

Trp Ala Leu Leu 

Met Gly Ser Asn 
40 

Arg Val Met Ala 
55 

Lys Asp Leu Ala 
70 

Glu Cys Tyr Arg 

85 

Arg Cys Pro Leu 

Ala Asp Ala Arg 
120 

Thr Gin Lys Trp 
135 

Leu Val Leu Ala 
150 

Thr Phe Arg Lys 
165 

Asp Met Asp Ser 

Pro Ala Glu Lys 
200 

Gly Cys Cys Gly 
215 

Phe His Phe Trp 
230 

Ser Tyr Leu Leu 
245 

Phe Gin Pro Ala 

Ala Phe Thr Leu 
280 

Gly Gly Ser Leu 
295 



Gly Leu Gly Gin 
10 

Leu Asn Arg Ala 
25 

Ala Val Ser Ser 

Arg Leu Glu Pro 
60 

Phe Lys Phe Glu 
75 

Ser Ser Glu Val 
90 

Trp Gly Asp Ala 
105 

Ala Phe Phe Ala 

Trp Gly Asp Met 
140 

Phe Phe Cys Pro 
155 

Ser Glu Glu Glu 
170 

Val He Asn Gly 
185 

Thr Pro Leu Gly 

Gly Arg Cys Gly 
220 

Gly Ala Pro Val 
235 

Phe Leu Leu Leu 
250 

Pro Pro Gly Ser 
265 

Leu Cys Glu Glu 

Ala Ser Gly Gly 
300 



Ala Pro Trp Ser 
15 

Gin Met Ala Met 
30 

Ala Leu Gly Ala 
45 

Asp Ala Glu Glu 

Gly Met Gly Val 
80 

Arg Ala Ala Arg 
95 

Thr Cys Leu Gin 
110 

Gin Asp Gly Val 
125 

Ala Ser Thr Thr 

Pro Leu He Tyr 
160 

Pro Thr Arg Glu 
175 

Glu Gly Pro Val 
190 

Val Pro Arg Gin 
205 

Gly Arg Arg Cys 

Thr He Phe Met 
240 

Phe Ser Arg Val 
255 

Leu Glu Leu Leu 
270 

Leu Arg Gin Gly 
285 

Pro Gly Pro Gly 
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n J. s 


AX a 


Ser 


Leu 


Ser 


Gin 


Arg 


Leu 


Ara 


Leu 


Tyr 


Leu 


Ala 


Asp 


Ser 


Trp 


305 










310 










315 










320 


Asn 




Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly 


Val 


Gly 






325 










330 










335 




Cys 


ArQ 


Leu 


Thr 


Pro 


Gly 


Leu 


Tvr 


His 


Leu 


Glv 


Ara 


Thr 


Val 


Leu 


Cys 




340 










345 










350 






J. X6 


Asp 


It lis 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 














360 










365 








Asn 


Lys 




Leu 


Gly 


Pro 


Lys 
375 


He 


Vai 


He 


Val 


Ser 
380 


Lys 


Met 


Met 


Lvs 


Asp 


vai 


t fie 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Glv 

V? J. jr 


Val 


Tro 


Leu 


Val 


Ala 


Tyr 


385 










390 










395 










400 


Gly 


val 


Aia 


Thr 


Glu 










Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 








405 










4 10 










415 




C A V 


lie 


Leu 


Arg Arg 


Val 
V ax 


Phe 


lyi 


Arg 


Pro 


T vir 
i y i 


Leu 


Gin 


He 


Phe 


Glv 








420 










425 










430 






oxn 


lie 


Pro 

i4 


Gin 


Glu 


Asp 


Met 


As p 
4 4 0 


Val 


Ala 


Leu 


Met 


Glu 
445 


His 


Ser 


Asn 


Cys 


oer 


Ser 


Glu 


Pro 


Gly 


Phe 


J. xp 


Ala 


His 


Pro 


Pro 


Gly Ala 


Gin 


Ala 


450 










455 










460 










\jjLy 


1 nr 


Cys 


Val 


Ser 


Gin 


Tyr 


Ala 


Asn 


Tm 


Leu 


Val 


Val 


Leu 


Leu 


Leu 


465 








470 










475 










480 


Val 


lie 


Phe 


Leu 


Leu 
485 


Val 


Ala 




He 


Leu 
4 90 


Leu 


Val 


Asn 


Leu 


Leu 
4 95 


He 


Ala 


ner 


jrne 


Ser 


Tyr 


Thr 


Phe 


Gly 


Lys 


Val 


Gin 


Glv 


Asn 


Ser 


AsD 


Leu 








500 








505 










510 






Tyr 


Trp 


Lys 


Ala 


Gin 


Arg 


Tyr 


Arg 


Leu 


He 


Arg 


Glu 


Phe 


His 


Ser 


Arg 


515 










520 










525 








Pro 


TV 1 

Ala 


Leu 


Ala 


Pro 


Pro 


Phe 


X Xc 


Val 


He 


Ser 


His 


Leu Arg 


Leu 


Leu 














535 










540 










Leu 


Ar^ 


uin 


Leu 


Cys 


Arg Arg 


Pro 


Xaa 


Ser 


Pro 


Gin 


Pro 


Ser 


Ser 


Pro 


545 








550 










555 










560 


Ala 


T All 

lieu 


ulU 


His 


Phe 
565 


Arg 


Val 


Tyr 


Leu 


Ser 
570 


Lys 


Glu 


Ala 


Glu 


Arg 
575 


Lys 


Leu 


Leu 


1 nr 


Trp 
580 


Glu 


Ser 


Val 


His 


Lys 
585 


Glu 


Asn 


Phe 


Leu 


Leu 
590 


Ala 


Ara 


Til a 


Arg 


Asp 


Lys 


Arg 


Glu 


Ser 


Asp 


Ser 


Glu 


Xaa 


Leu 


Lys 


Arg 


Thr 


Ser 




595 










600 










605 










Lys 

D 1 u 


V ax 


Asp 


Leu 


Ala 


Leu 
615 


Lys 


Gin 


Leu 


Glv 


His 
620 


He 


Arg 


Glu 


Tyr 


d 11 

ox U 


Gin 


Arg 


Leu 


Lys 


Val 


Leu 


Glu 


Arg 


Glu 


Val 


Gin 


Gin 


Cys 


Thr 


Ser 


625 






630 










635 










640 


Ala 


Pro 


Ala 


Pro 


Gly 
645 


Gly 


Leu 


Val 


Leu 


Glu 
650 


Val 


Ser 


Pro 


Met 


Ser 
655 


He 


Trp 


Ala 


Thr 


Val 


Arg 


Thr 


Thr 


Phe 


Gly 


Ser 


Val 


He 


Leu 


Thr 


Asn 


His 






660 










665 










670 






Ser 


Met 


Pro 


Gly 


Ser 


Ser 


Gin 


Asn 


Gin 


Ser 


Gin 


Pro 


Gly Arg 


He 


Lys 






675 










680 










685 








Ala 


Trp 
690 


He 


Pro 


Gly 


Arg 


Tyr 
695 


Pro 


Ser 


Gly 


Gly 


Cys 
700 


Arg 


Val 


Leu 


Gly 


Val 


Thr 


Gly 


Thr 


Thr 


Asp 


Pro 


Ser 


Pro 


Leu 


Thr 


Asp 


Ser 


Ser 


His 


Trp 


705 








710 










715 










720 


Gly 


Asn 


Lys 


Ala 


He 

























725 



<210> 27 

<211> 7419 

<212> DNA 

<213> Homo Sapiens 
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<400> 27 

cggggaccga tccagcctcc ggactctagc ctaggctttt gcaaaaagct atttaggtga 60 

cactatagaa ggtacgcctg caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc 120 

gcagccccgt cgccggcgga ggcgggcgcg ggcgcgtncc ctgtggccag tcacccggag 180 

gagttggtcg cacaattatg aaagactcgg ctrctgctgc tagcgccgga gctgagttag 240 

ttctgagaag gtttccctgg gcgttccttg tccggcggcc tctgctgccg cctccggaga 300 

cgcttcccga tagatggcta caggccgcgg aggaggagga ggtggagttg ctgcccttcc 360 

ggagtccgcc ccgtgaggag aatgtcccag aaatcctgga tagaaagcac tttgaccaag 420 

agggaatgtg tatatattat accaagttcc aaggaccctc acagatgcct tccaggatgt 480 

caaatttgtc agcaactcgt caggtgtttt tgtggtcgct tggtcaagca acatgcttgt 540 

tttactgcaa gtcttgccat gaaatactca gatgtgaaat tgggtgacca ttrtaatcag 600 

gcaatagaag aatggtctgt ggaaaagcat acagaacaga gcccaacgga tgcttatgga 660 

gtcataaatt ttcaaggggg ttctcattcc tacagagcta agtatgtgag gctatcatat 720 

gacaccaaac ctgaagtcat tctgcaactt ctgcttaaag aatggcaaat ggagttaccc 780 

aaacttgtta tctctgtaca tgggggcatg cagaaatttg agcttcaccc acgaatcaag 840 

cagttgcttg gaaaaggtct tattaaagct gcagttacaa ctggagcctg gattttaact 900 

ggaggagtaa acacaggtgt ggcaaaacat gttggagatg ccctcaaaga acatgcttcc 960 

agatcatctc gaaagatttg cactatcgga atagctccat ggggagtgat tgaaaacaga 1020 

aatgatcttg ttgggagaga tgtggttgct ccttatcaaa ccttattgaa ccccctgagc 108 0 

aaattgaatg ttttgaataa tctgcattcc catttcatat tggtggatga tggcactgtt 114 0 

ggaaagtatg gggcggaagt cagactgaga agagaacttg aaaaaactat taatcagcaa 1200 
agaattcatg ctaggattgg ccagggtgtc cctgtggtgg cacttatatt tgagggtggg • 1260 

ccaaatgtta tcctcacagt tcttgaatac cttcaggaaa gcccccctgt tccagtagtt 1320 

gtgtgtgaag gaacaggcag agctgcagat ctgctagcgt atattcataa acaaacagaa 1380 

gaaggaggga atcttcctga tgcagcagag cccgatatta tttccactat caaaaaaaca 14 4 0 

tttaactttg gccagaatga agcacttcat ttatttcaaa cactgatgga gtgcatgaaa 1500 

agaaaggagc ttatcactgt tttccatatt gggtcagatg aacatcaaga tatagatgta 1560 

gcaatactta ctgcactgct aaaaggtact aatgcatctg catttgacca gcttatcctt 1620 

acattggcat gggatagagt tgacattgcc aaaaatcatg tatttgttta tggacagcag 1680 

tggctggttg gatccttgga acaagctatg cttgatgctc ttgtaatgga tagagttgca 174 0 

tttgtaaaac ttcttattga aaatggagta agcatgcata aattccttac cattccgaga 1800 

ctggaagaac tttacaacac taaacaaggt ccaactaatc caatgctgtt tcatcttgtt 1860 

cgagacgtca aacagggaaa tcttcctcca ggatataaga tcactctgat tgatatagga 1920 

cttgttattg aatatctcat gggaggaacc tacagatgca cctatactag gaaacgtttt 1980 

cgattaatat ataatagtct tggtggaaat aatcggaggt ctggccgaaa tacctccagc 2040 

agcactcctc agttgcgaaa gagtcatgaa tcttttggca atagggcaga taaaaaggaa 2100 

aaaatgaggc ataaccattt cattaagaca gcacagccct tccgaccaaa gattgataca 2160 

gttatggaag aaggaaagaa gaaaagaacc aaagatgaaa ttgtagacat tgatgatcca 2220 

gaaaccaagc gctttcctta tccacttaat gaacttttaa tttgggcttg ccttatgaag 2280 

aggcaggtca tggcccgttt tttatggcaa catggtgaag aatcaatggc taaagcatta 2340 

gttgcctgta agatctatcg ttcaatggca tatgaagcaa agcagagtga cctggtagat 2400 

gatacttcag aagaactaaa acagtattcc aatgattttg gtcagttggc cgttgaatta 2460 

ttagaacagt ccttcagaca agatgaaacc atggctatga aattgctcac ttatgaactg 2520 

aagaactgga gtaattcaac ctgccttaag ttagcagttt cttcaagact tagacctttt 2580 

gtagctcaca cctgtacaca aatgttgtta tctgatatgt ggatgggaag gctgaatatg 2640 

aggaaaaatt cctggtacaa ggtcatacta agcattttag ttccacctgc catattgctg 2700 

ttagagtata aaactaaggc tgaaatgtcc catatcccac aatctcaaga tgctcatcag 2760 

atgacaatgg atgacagcga aaacaacttt cagaacataa cagaagagat ccccatggaa 2820 

gtgtttaaag aagtacggat tttggatagt aatgaaggaa agaatgagat ggagatacaa 2880 

atgaaatcaa aaaagcttcc aattacgcga aagttttatg ccttttatca tgcaccaatt 2 94 0 

gtaaaattct ggtttaacac gttggcatat ttaggatttc tgatgcttta tacatttgtg 3000 

gttcttgtac aaatggaaca gttaccttca gttcaagaat ggattgttat tgcttatatt 3060 

tttacttatg ccattgagaa agtccgtgag atctttatgt ctgaagctgg gaaagtaaac 3120 

cagaagatta aagtatggtt tagtgattac ttcaacatca gtgatacaat tgccataatt 3180 

tctttcttca ttggatttgg actaagattt ggagcaaaat ggaactttgc aaatgcatat 324 0 

gataatcatg tttttgtggc tggaagatta atttactgtc ttaacataat attttggtat 3300 

gtgcgtttgc tagattttct agctgtaaat caacaggcag gaccttatgt aatgatgatt 3360 

ggaaaaatgg tggccaatat gttctacatt gtagtgatta tggctcttgt attacttagt 3420 

tttggtgttc ccagaaaggc aatactttat cctcatgaag caccatcttg gactcttgct 3480 

aaagatatag tttttcaccc atactggatg atttttggtg aagtttatgc atacgaaatt 3540 
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gatgtgtgtg caaatgattc tgttatcccc caaatctgtg gtcctgggac gtggttgact 3600 

ccatttcttc aagcagtcta cctctttgta cagtatatca ttatggttaa tcttcttatt 3660 

gcatttttca acaatgtgta tttacaagtg aaggcaattt ccaatattgt atggaagtac 3720 

cagcgttatc attttattat ggcttatcat gagaaaccag ttctgcctcc tccacttatc 3780 

attcttagcc atatagtttc tctgttttgc tgcatatgta agagaagaaa gaaagataag 3840 

acttccgatg gaccaaaact tttcttaaca gaagaagatc aaaagaaact tcatgatttt 3900 

gaagagcagt gtgttgaaat gtatttcaat gaaaaagatg acaaatttca ttctgggagt 3960 

gaagagagaa ttcgtgtcac ttttgaaaga gtggaacaga tgtgcattca gattaaagaa 4020 

gttggagatc gtgtcaacta cataaaaaga tcattacaat cattagattc tcaaattggc 4080 

catttgcaag atctttcagc cctgacggta gai:acattaa aaacactcac tgcccagaaa 4140 

gcgtcggaag ctagcaaagt tcataatgaa atcacacgag aactgagcat ttccaaacac 4200 

ttggctcaaa acctrattga tgatggtcct gtaagacctt ctgtatggaa aaagcatggt 4260 

gttgtaaata cacttagctc ctctcttcct caaggtgatc ttgaaagtaa taatcctttt 4320 

cattgtaata ttttaatgaa agatgacaaa gacccccagt gtaatatatt tggtcaagac 4380 

ttacctgcag taccccagag aaaagaattt aartttccag aggctggttc ctcttctggt 4440 

gccttattcc caagtgctgt ttcccctcca gaactgcgac agagactaca tggggtagaa 4500 

ctcttaaaaa tatttaataa aaatcaaaaa ttaggcagtt catctactag cataccacat 4560 

ctgtcatccc caccaaccaa attttttgtt agtacaccat ctcagccaag ttgcaaaagc 4 620 

cacttggaaa ctggaaccaa agatcaagaa actgtttgct ctaaagctac agaaggagat 4 680 

aatacagaat ttggagcatt tgtaggacac agagatagca tggatttaca gaggtttaaa 4740 

gaaacatcaa acaagataaa aatactatcc aataacaata cttctgaaaa cactttgaaa 4800 

cgagtgagtt ctcttgctgg atttactgac tgtcacagaa cttccattcc tgttcattca 4860 

aaacaagcag aaaaaatcag tagaaggcca tcraccgaag acactcatga agtagattcc 4 920 

aaagcagctt taataccgga ttggttacaa gatagaccat caaacagaga aatgccatct 4 980 

gaagaaggaa cattaaatgg tctcacttct ccatttaagc cagctatgga tacaaattac 5040 

tattattcag ctgtggaaag aaataacttg atgaggttat cacagagcat tccatttaca 5100 

cctgtgcctc caagagggga gcctgtcaca gtgtatcgtt tggaagagag ttcacccaac 5160 

atactaaata acagcatgtc ttcttggtca caactaggcc tctgtgccaa aatagagttt 5220 

ttaagcaaag aggagatggg aggaggttta cgaagagctg tcaaagtaca gtgtacctgg 5280 

tcagaacatg atatcctcaa atcagggcat ctttatatta tcaaatcttt tcttccagag 5340 

gtggttaata catggtcaag tatttataaa gaagatacag ttctgcatct ctgtctgaga 5400 

gaaattcaac aacagagagc agcacaaaag cttacgtttg cctttaatca aatgaaaccc 54 60 

aaatccatac catattctcc aaggttcctt gaagttttcc tgctgtattg ccattcagca 5520 

ggacagtggt ttgctgtgga agaatgtatg actggagaat ttagaaaata caacaataat 5580 

aatggagatg agattattcc aactaatact ctggaagaga tcatgctagc ctttagccac 5640 

tggacttacg aatatacaag aggggagtta ctggtacttg atttgcaagg tgttggtgaa 5700 

aatttgactg acccatctgt gataaaagca gaagaaaaga gatcctgtga tatggttttt 5760 

ggcccagcaa atctaggaga agatgcaatt aaaaacttca gagcaaaaca tcactgtaat 5820 

tcttgctgta gaaagcttaa acttccagat ccgaagagga atgattatac gcctgataaa 5880 

attatatttc ctcaggatga gccttcagat ttgaatcttc agcctggaaa ttccaccaaa 5940 

gaatcagaat caactaattc tgttcgtctg atgttataat attaatatta ctgaatcatt 6000 

ggttttgcct gcacctcaca gaaatgttac tgtgtcactt ttccctcggg aggaaattgt 6060 

ttggtaatat agaaaggtgt atgcaagttg aatttgctga ctccagcaca gttaaaaggt 6120 

caatattctt ttgacctgat taatcagtca gaaagtccct ataggataga gctggcagct 6180 

gagaaatttt aaaggtaatt gataattagt atttgtaact ttttaaaggg ctctttgtat 6240 

agcagaggat ctcatttgac tttgttttga tgagggtgat gccctctctt atgtggtaca 6300 

ataccattaa ccaaaggtag gtgtccatgc agattttatt ggcagctgtt ttattgccat 6360 

tcaactaggg aaatgaagaa atcacgcagc cttttggtta aatggcagtc aaaattttcc 6420 

tcagtgtatt tagtgtgttc agtgatgata tcactggttc ccaactagat gcttgttggc 6480 

cacgggaagg gaaatgactt gttctaattc taggttcaca gaggtatgag aagcctgaac 6540 

tgaagaccat tttcaagagg gacggtattt atgaatcagg gttaggctcc atatttaaag 6600 

atagagccag tttttttttt aaatagaacc caaattgtgt aaaaatgtta attgggtttt 6660 

ttaaacattg ttttatcaag tcactgttaa gtagaagaaa gccatggtaa actgatacat 6720 

aacctaaatt ataaaagcag aaacctaact cactcgtcaa gggaagttac cttttgagga 67 80 

aagttaaagt acttttttcc ctatctgtat ctatagcaac aacccagaac ttacaaactt 6840 

ctccaaagat tttattgatt gttatatcaa atcagaatgt aaacatgaac tcttgcatat 6900 

atttaaaatt gtgttggaac atttgaacat gaatgctgtt tgggtactta agaaattrat 6960 

tcagtnggat tatcattatg tganactggc agattgcagt gcanccttat gccaataaaa 7 020 

tgtaatttar cagccccaga tattgttgaa tattcaacaa taacaagaaa agcttttcat 7080 

ctaagtttta tgctttaatt ttttttcttt ttttttcttt ttcttttgtt tccttggtac 7140 
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taattttaat ttttatttgg aagggagcag tataaagctt atttgtattt agtagtgtat 7200 

ctcatagata cagacaaggc aagagatgat aagctgttta aatagtgktt aatattgatt 7260 

gggggtgggg agaaagaaaa agtgtattac ttaaagatac tatatacskt ttktatatca 7320 

ttaaatcttt aaaagaaatn naataaattt attgtttnca aaaaaaaaac ccnntaaaaa 7380 

aaaaagggcg gcccctctag aggatccctc gaggggccc 7419 



<210> 28 
<211> 1865 
<212> PRT 

<213> Homo Sapiens 



<400> 28 
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Val 
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Tyr His Ala Pro lie Val Lys Phe Trp Phe Asn Thr Leu Ala Tyr Leu 

850 855 860 

Gly Phe Leu Met Leu Tyr Thr Phe Val Val Leu Val Gin Met Glu Gin 
865 870 875 880 

Leu Pro Ser Val Gin Glu Trp lie Val lie Ala Tyr lie Phe Thr Tyr 

885 890 895 

Ala He Glu Lys Val Arg Glu He Phe Met Ser Glu Ala Gly Lys Val 

900 905 910 

Asn Gin Lys He Lys Val Trp Phe Ser Asp Tyr Phe Asn He Ser Asp 

915 920 925 

Thr He Ala He He Ser Phe Phe He Gly Phe Gly Leu Arg Phe Gly 

930 935 940 

Ala Lys Trp Asn Phe Ala Asn Ala Tyr Asp Asn His Val Phe Val Ala 
945 950 955 960 

Gly Arg Leu He Tyr Cys Leu Asn He He Phe Trp Tyr Val Arg Leu 

965 970 975 

Leu Asp Phe Leu Ala Val Asn Gin Gin Ala Gly Pro Tyr Val Met Met 

980 985 990 

He Gly Lys Met Val Ala Asn Met Phe Tyr He Val Val He Met Ala 

995 1000 1005 

Leu Val Leu Leu Ser Phe Gly Val Pro Arg Lys Ala He Leu Tyr Pro 

1010 1015 1020 

His Glu Ala Pro Ser Trp Thr Leu Ala Lys Asp He Val Phe His Pro 
1025 1030 1035 1040 

Tyr Trp Met He Phe Gly Glu Val Tyr Ala Tyr Glu He Asp Val Cys 

1045 1050 1055 

Ala Asn Asp Ser Val He Pro Gin He Cys Gly Pro Gly Thr Trp Leu 

1060 1065 1070 

Thr Pro Phe Leu Gin Ala Val Tyr Leu Phe Val Gin Tyr He He Met 

1075 1080 1085 

Val Asn Leu Leu He Ala Phe Phe Asn Asn Val Tyr Leu Gin Val Lys 

1090 1095 1100 

Ala He Ser Asn He Val Trp Lys Tyr Gin Arg Tyr His Phe He Met 
1105 1110 1115 1120 

Ala Tyr His Glu Lys Pro Val Leu Pro Pro Pro Leu He He Leu Ser 

1125 1130 1135 

His He Val Ser Leu Phe Cys Cys He Cys Lys Arg Arg Lys Lys Asp 

1140 1145 1150 

Lys Thr Ser Asp Gly Pro Lys Leu Phe Leu Thr Glu Glu Asp Gin Lys 

1155 1160 1165 

Lys Leu His Asp Phe Glu Glu Gin Cys Val Glu Met Tyr Phe Asn Glu 

1170 1175 1180 

Lys Asp Asp Lys Phe His Ser Gly Ser Glu Glu Arg He Arg Val Thr 
1185 1190 1195 1200 

Phe Glu Arg Val Glu Gin Met Cys He Gin He Lys Glu Val Gly Asp 

1205 1210 1215 

Arg Val Asn Tyr He Lys Arg Ser Leu Gin Ser Leu Asp Ser Gin He 

1220 1225 1230 

Gly His Leu Gin Asp Leu Ser Ala Leu Thr Val Asp Thr Leu Lys Thr 

1235 1240 1245 

Leu Thr Ala Gin Lys Ala Ser Glu Ala Ser Lys Val His Asn Glu He 

1250 1255 1260 

Thr Arg Glu Leu Ser He Ser Lys His Leu Ala Gin Asn Leu He Asp 
1265 1270 1275 1280 

Asp Gly Pro Val Arg Pro Ser Val Trp Lys Lys His Gly Val Val Asn 

1285 1290 1295 

Thr Leu Ser Ser Ser Leu Pro Gin Gly Asp Leu Glu Ser Asn Asn Pro 

1300 1305 1310 

Phe His Cys Asn He Leu Met Lys Asp Asp Lys Asp Pro Gin Cys Asn 
1315 1320 1325 
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Ile Phe Gly Gin Asp Leu Pro Ala Val Pro Gin Arg Lys Glu Phe Asn 

1330 1335 1340 

Phe Pro Glu Ala Gly Ser Ser Ser Gly Ala Leu Phe Pro Ser Ala Val 
1345 1350 1355 1360 

Ser Pro Pro Glu Leu Arg Gin Arg Leu His Gly Val Glu Leu Leu Lys 

1365 1370 1375 

lie Phe Asn Lys Asn Gin Lys Leu Gly Ser Ser Ser Thr Ser lie Pro 

1380 1385 1390 

His Leu Ser Ser Pro Pro Thr Lys Phe Phe Val Ser Thr Pro Ser Gin 

1395 1400 1405 

Pro Ser Cys Lys Ser His Leu Glu Thr Gly Thr Lys Asp Gin Glu Thr 

1410 1415 1420 

Val Cys Ser Lys Ala Thr Glu Gly Asp Asn Thr Glu Phe Gly Ala Phe 
1425 1430 1435 1440 

Val Gly His Arg Asp Ser Met Asp Leu Gin Arg Phe Lys Glu Thr Ser 

1445 1450 1455 

Asn Lys lie Lys lie Leu Ser Asn Asn Asn Thr Ser Glu Asn Thr Leu 

1460 1465 1470 

Lys Arg Val Ser Ser Leu Ala Gly Phe Thr Asp Cys His Arg Thr Ser 

1475 1480 1485 

lie Pro Val His Ser Lys Gin Ala Glu Lys lie Ser Arg Arg Pro Ser 

1490 1495 1500 

Thr Glu Asp Thr His Glu Val Asp Ser Lys Ala Ala Leu He Pro Asp 
1505 1510 1515 1520 

Trp Leu Gin Asp Arg Pro Ser Asn Arg Glu Met Pro Ser Glu Glu Gly 

1525 1530 1535 

Thr Leu Asn Gly Leu Thr Ser Pro Phe Lys Pro Ala Met Asp Thr Asn 

1540 1545 1550 

Tyr Tyr Tyr Ser Ala Val Glu Arg Asn Asn Leu Met Arg Leu Ser Gin 

1555 1560 1565 

Ser He Pro Phe Thr Pro Val Pro Pro Arg Gly Glu Pro Val Thr Val 

1570 1575 1580 

Tyr Arg Leu Glu Glu Ser Ser Pro Asn He Leu Asn Asn Ser Met Ser 
1585 1590 1595 1600 

Ser Trp Ser Gin Leu Gly Leu Cys Ala Lys He Glu Phe Leu Ser Lys 

1605 1610 1615 

Glu Glu Met Gly Gly Gly Leu Arg Arg Ala Val Lys Val Gin Cys Thr 

1620 1625 1630 

Trp Ser Glu His Asp He Leu Lys Ser Gly His Leu Tyr He He Lys 

1635 1640 1645 

Ser Phe Leu Pro Glu Val Val Asn Thr Trp Ser Ser He Tyr Lys Glu 

1650 1655 1660 

Asp Thr Val Leu His Leu Cys Leu Arg Glu He Gin Gin Gin Arg Ala 
1665 1670 1675 1680 

Ala Gin Lys Leu Thr Phe Ala Phe Asn Gin Met Lys Pro Lys Ser He 

1685 1690 1695 

Pro Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His Ser 

1700 1705 1710 

Ala Gly Gin Trp Phe Ala Val Glu Glu Cys Met Thr Gly Glu Phe Arg 

1715 1720 1725 

Lys Tyr Asn Asn Asn Asn Gly Asp Glu He He Pro Thr Asn Thr Leu 

1730 1735 1740 

Glu Glu He Met Leu Ala Phe Ser His Trp Thr Tyr Glu Tyr Thr Arg 
1745 1750 1755 1760 

Gly Glu Leu Leu Val Leu Asp Leu Gin Gly Val Gly Glu Asn Leu Thr 

1765 1770 1775 

Asp Pro Ser Val He Lys Ala Glu Glu Lys Arg Ser Cys Asp Met Val 

1780 1785 1790 

Phe Gly Pro Ala Asn Leu Gly Glu Asp Ala He Lys Asn Phe Arg Ala 

1795 1800 1805 
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Lys His His Cys Asn Ser Cys Cys Arg Lys Leu Lys Leu Pro Asp Leu 

1810 1815 1820 

Lys Arg Asn Asp Tyr Thr Pro Asp Lys lie lie Phe Pro Gin Asp Glu 
1825 1830 1835 1840 

Pro Ser Asp Leu Asn Leu Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu 

1845 1850 1855 

Ser Thr Asn Ser Val Arg Leu Met Leu 
1860 1865 

<210> 29 

<211> 4061 

<212> DNA 

<213> Homo Sapiens 



<400> 29 

ggtctggaag cagagccggc ggagggagcg ccggggccct gggctgcagg aggttgcggc 
ggccgcggca gcatggtggt gccggagaag gagcagagct ggatccccaa gatcttcaag 
aagaagacct gcacgacgtt catagttgac tccacagatc cgggagggac cttgtgccag 
tgtgggcgcc cccggaccgc ccaccccgca gtggccatgg aggatgcctt cggggcagcc 
gtggtgaccg tgtgggacag cgatgcacac accacggaga agcccaccga tgcctacgga 
gagctggact tcacgggggc cggccgcaag cacagcaatt tcctccggct ctctgaccga 
acggatccag ctgcagttta tagtctggtc acacgcacat ggggcttccg tgccccgaac 
ctggtggtgt cagtgctggg gggatcgggg ggccccgtcc tccagacctg gctgcaggac 
ctgctgcgtc gtgggctggt gcgggctgcc cagagcacag gagcctggat tgtcactggg 
ggtctgcaca cgggcatcgg ccggcatgtt ggtgtggctg tacgggacca tcagatggcc 
agcactgggg gcaccaaggt ggtggccatg ggtgtggccc cctggggtgt ggtccggaat 
agagacaccc tcatcaaccc caagggctcg ttccctgcga ggtaccggtg gcgcggtgac 
ccggaggacg gggtccagtt tcccctggac tacaactact cggccttctt cctggtggac 
gacggcacac acggctgcct ggggggcgag aaccgcttcc gcttgcgcct ggagtcctac 
atctcacagc agaagacggg cgtgggaggg actggaattg acatccctgt cctgctcctc 
ctgattgatg gtgatgagaa gatgttgacg cgaatagaga acgccaccca ggctcagctc 
ccatgtctcc tcgtggctgg ctcaggggga gctgcggact gcctggcgga gaccctggaa 
gacactctgg ccccagggag tgggggagcc aggcaaggcg aagcccgaga tcgaatcagg 
cgtttctttc ccaaagggga ccttgaggtc ctgcaggccc aggtggagag gattatgacc 
cggaaggagc tcctgacagt ctattcttct gaggatgggt ctgaggaatt cgagaccata 
gttttgaagg cccttgtgaa ggcctgtggg agctcggagg cctcagccta cctggatgag 
ctgcgtttgg ctgtggcttg gaaccgcgtg gacattgccc agagtgaact ctttcggggg 
gacatccaat ggcggtcctt ccatctcgaa gcttccctca tggacgccct gctgaatgac 
cggcctgagt tcgtgcgctt gctcatttcc cacggcctca gcctgggcca cttcctgacc 
ccgatgcgcc tggcccaact ctacagcgcg gcgccctcca actcgctcat ccgcaacctt 
ttggaccagg cgtcccacag cgcaggcacc aaagccccag ccctaaaagg gggagctgcg 
gagctccggc cccctgacgt ggggcatgtg ctgaggatgc tgctggggaa gatgtgcgcg 
ccgaggtacc cctccggggg cgcctgggac cctcacccag gccagggctt cggggagagc 
atgtatctgc tctcggacaa ggccacctcg ccgctctcgc tggatgctgg cctcgggcag 
gccccctgga gcgacctgct tctttgggca ctgttgctga acagggcaca gatggccatg 
tacttctggg agatgggttc caatgcagtt tcctcagctc ttggggcctg tttgctgctc 
cgggtgatgg cacgcctgga gcctgacgct gaggaggcag cacggaggaa agacctggcg 
ttcaagtttg aggggatggg cgttgacctc tttggcgagt gctatcgcag cagtgaggtg 
agggctgccc gcctcctcct ccgtcgctgc ccgctctggg gggatgccac ttgcctccag 
ctggccatgc aagctgacgc ccgtgccttc tttgcccagg atggggtaca gtctctgctg 
acacagaagt ggtggggaga tatggccagc actacaccca tctgggccct ggttctcgcc 
ttcttttgcc ctccactcat ctacacccgc ctcatcacct tcaggaaatc agaagaggag 
cccacacggg aggagctaga gtttgacatg gatagtgtca ttaatgggga agggcctgtc 
gggacggcgg acccagccga gaagacgccg ctgggggtcc cgcgccagtc gggccgtccg 
ggttgctgcg ggggccgctg cggggggcgc cggtgcctac gccgctggtt ccacttctgg 
ggcgcgccgg tgaccatctt catgggcaac gtggtcagct acctgctgtt cctgctgctt 
ttctcgcggg tgctgctcgt ggatttccag ccggcgccgc ccggctccct ggagctgctg 
ctctatttct gggctttcac gctgctgtgc gaggaactgc gccagggcct gagcggaggc 
gggggcagcc tcgccagcgg gggccccggg cctggccatg cctcactgag ccagcgcctg 
cgcctctacc tcgccgacag ctggaaccag tgcgacctag tggctctcac ctgcttcctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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ctgggcgtgg 
atcgacttca 
gggcccaaga 
ctcggcgtgt 
agtgacttcc 
cagattcccc 
cccggcttct 
aactggctgg 
aacttgctca 
tactggaagg 
ccgcccttta 
cggagccccc 
gccgagcgga 
gctagggaca 
ttggcactga 
cgggaggtcc 
gccttgctgc 
ctgctggcgg 
tctgggcctc 
gggccactgt 
cctcccagaa 
ctggaggctg 
cctcacactg 



gctgccggct 
tggttttcac 
tcgtcatcgt 
ggctggtagc 
caagtatcct 
aggaggacat 
gggcacaccc 
tggtgctgct 
ttgccatgtt 
cgcagcgtta 
tcgtcatctc 
agccgtcctc 
agctgctaac 
agcgggagag 
aacagctggg 
agcagtgtag 
ccccaggtgg 
acttcaagga 
ggcccccgca 
caggaccacc 
ccagtcccag 
cagggtcctt 
gggaaataaa 



gaccccgggt 
ggtgcggctg 
gagcaagatg 
ctatggcgtg 
gcgccgcgtc 
ggacgtggcc 
tcctggggcc 
cctcgtcatc 
cagttacaca 
ccgcctcatc 
ccacttgcgc 
cccggccctc 
gtgggaatcg 
cgactccgag 
acacatccgc 
ccgcgtcctg 
gccgccaccc 
gaagccccca 
cctggtggcc 
tttgggagtg 
cctgggagga 
ggggtaacag 
gccatttcag 
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ttgtaccacc 
crtcacatct 
atgaaggacg 
gccacggagg 
ttctaccgtc 
ctcatggagc 
caggcgggca 
ttcctgctcg 
ttcggcaaag 
cgggaattcc 
ctcctgctca 
gagcatttcc 
gtgcataagg 
cgtctgaagc 
gagtacgaac 
gggtgggtgg 
cctgacctgc 
caggggattt 
ttgtccttga 
tcatccttac 
tcaaggcctg 
ggaccacaga 
aggaaaaaaa 



tgggccgcac 
tcacggtcaa 
tgttcttctt 
ggctcctgag 
cctacctgca 
acagcaactg 
cctgcgtctc 
tggccaacat 
tacagggcaa 
actctcggcc 
ggcaattgtg 
gggtttacct 
agaactttct 
gcacgtccca 
agcgcctgaa 
ccgaggccct 
ctgggtccaa 
tgctcctaga 
ggtgagcccc 
aaaccacagc 
gatcccgggc 
cccctcacca 
a 



tgtcctctgc 
caaacagctg 
cctcttcttc 
gccacgggac 
gatcttcggg 
ctcgtcggag 
ccagtatgcc 
cctgctggtc 
cagcgatctc 
cgcgctggcc 
caggcgaccc 
ttctaaggaa 
gctggcacgc 
gaaggtggac 
agtgctggag 
gagccgctct 
agactgagcc 
gtaaggctca 
atgtccatct 
atgcccggct 
cgttatccat 
ctcacagatt 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4061 





<210> 


30 




























<211> 


1214 


























<212> 


PRT 




























<213> 


Homo Sapiens 






















<400> 


30 


























Met 


Val 


Val 


Pro 


Glu 


Lys 


Glu 


Gin 


Ser 


Trp 


He 


Pro 


Lys 


He 


Phe 


Lys 


1 








5 










10 










15 




Lys 


Lys 


Thr 


Cys 


Thr 


Thr 


Phe 


He 


Val 


Asp 


Ser 


Thr 


Asp 


Pro 


Gly Gly 




20 










25 










30 






Thr 


Leu 


Cys 


Gin 


Cys 


Gly 


Arg 


Pro 


Arg 


Thr 


Ala 


His 


Pro 


Ala 


Val 


Ala 






35 










40 










45 








Met 


Glu 


Asp 


Ala 


Phe 


Gly 


Ala 


Ala 


Val 


Val 


Thr 


Val 


Trp 


Asp 


Ser 


Asp 




50 








55 










60 










Ala 


His 


Thr 


Thr 


Glu 


Lys 


Pro 


Thr 


Asp 


Ala 


Tyr 


Gly 


Glu 


Leu 


Asp 


Phe 


65 










70 










75 










80 


Thr 


Gly 


Ala 


Gly 


Arg 


Lys 


His 


Ser 


Asn 


Phe 


Leu 


Arg 


Leu 


Ser 


Asp 


Arg 






85 










90 










95 




Thr 


Asp 


Pro 


Ala 


Ala 


Val 


Tyr 


Ser 


Leu 


Val 


Thr 


Arg 


Thr 


Trp 


Gly 


Phe 






100 










105 










110 






Arg 


Ala 


Pro 


Asn 


Leu 


Val 


Val 


Ser 


Val 


Leu 


Gly 


Gly 


Ser 


Gly 


Gly 


Pro 




115 










120 










125 








Val 


Leu 


Gin 


Thr 


Trp 


Leu 


Gin 


Asp 


Leu 


Leu 


Arg 


Arg 


Gly 


Leu 


Val 


Arg 




130 










135 










140 










Ala 


Ala 


Gin 


Ser 


Thr 


Gly 


Ala 


Trp 


He 


Val 


Thr 


Gly 


Gly 


Leu 


His 


Thr 


145 










150 










155 










160 


Gly 


He 


Gly 


Arg 


His 


Val 


Gly 


Val 


Ala 


Val 


Arg 


Asp 


His 


Gin 


Met 


Ala 






165 










170 










175 




Ser 


Thr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Ala 


Met 


Gly 


Val 


Ala 


Pro 


Trp 


Gly 






180 










185 










190 






Val 


Val 


Arg 


Asn 


Arg 


Asp 


Thr 


Leu 


He 


Asn 


Pro 


Lys 


Gly 


Ser 


Phe 


Pro 






195 










200 










205 








Ala 


Arg 


Tyr 


Arg 


Trp 


Arg 


Gly 


Asp 


Pro 


Glu 


Asp 


Gly 


Val 


Gin 


Phe 


Pro 




210 










215 










220 










Leu 


Asp 


Tyr 


Asn 


Tyr 


Ser 


Ala 


Phe 


Phe 


Leu 


Val 


Asp 


Asp 


Gly 


Thr 


His 


225 






230 










235 










240 
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Gly Cys Leu Gly 

lie Ser Gin Gin 
260 

Val Leu Leu Leu 
275 

Giu Asn Ala Thr 
290 

Gly Gly Ala Ala 
305 

Pro Gly Ser Gly 

Arg Phe Phe Pro 
340 

Arg lie Met Thr 
355 

Gly Ser Glu Glu 
370 

Cys Gly Ser Ser 
385 

Val Ala Trp Asn 

Asp lie Gin Trp 
420 

Leu Leu Asn Asp 
435 

Leu Ser Leu Gly 
450 

Ser Ala Ala Pro 
465 

Ser His Ser Ala 

Glu Leu Arg Pro 
500 

Lys Met Cys Ala 
515 

Pro Gly Gin Gly 
530 

Thr Ser Pro Leu 
545 



Asp 


Leu 


Leu 


Leu 


Tyr 


Phe 


Trp 


Glu 








580 


Cys 


Leu 


Leu 


Leu 






595 




Ala 


Ala 


Arg 


Arg 




610 






Asp 


Leu 


Phe 


Gly 


625 








Leu 


Leu 


Leu 


Arg 


Leu 


Ala 


Met 


Gin 








660 


Gin 


Ser 


Leu 


Leu 






675 




Pro 


He 


Trp 


Ala 




690 






Thr 


Arg 


Leu 


He 


705 









Gly Glu Asn Arg 
245 

Lys Thr Gly Val 

Leu He Asp Gly 
280 

Gin Ala Gin Leu 
295 

Asp Cys Leu Ala 
310 

Gly Ala Arg Gin 

325 

Lys Gly Asp Leu 

Arg Lys Glu Leu 
360 

Phe Glu Thr He 
375 

Glu Ala Ser Ala 
390 

Arg Val Asp He 
405 

Arg Ser Phe His 

Arg Pro Glu Phe 
440 

His Phe Leu Thr 
455 

Ser Asn Ser Leu 
470 

Gly Thr Lys Ala 
485 

Pro Asp Val Gly 

Pro Arg Tyr Pro 
520 

Phe Gly Glu Ser 
535 

Ser Leu Asp Ala 
550 

Trp Ala Leu Leu 
565 

Met Gly Ser Asn 

Arg Val Met Ala 
600 

Lys Asp Leu Ala 
615 

Glu Cys Tyr Arg 
630 

Arg Cys Pro Leu 
645 

Ala Asp Ala Arg 

Thr Gin Lys Trp 
680 

Leu Val Leu Ala 
695 

Thr Phe Arg Lys 
710 
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Phe 


Arg 


Leu 


Arg 




250 






Gly 


Gly 


Thr 


Gly 


265 








Asp 


Glu 


Lys 


Met 


Pro 


Cys 


Leu 


Leu 








300 


Glu 


Thr 


Leu 


Glu 






315 




Gly 


Glu 


Ala 


Arg 




330 






Glu 


Val 


Leu 


Gin 


345 








Leu 


Thr 


Val 


Tyr 


Val 


Leu 


Lys 


Ala 








380 


Tyr 


Leu 


Asp 


Glu 






395 




Ala 


Gin 


Ser 


Glu 




410 






Leu 


Glu 


Ala 


Ser 


425 








Val 


Arg 


Leu 


Leu 


Pro 


Met 


Arg 


Leu 








4 60 


He 


Arg 


Asn 


Leu 






475 




Pro 


Ala 


Leu 


Lys 




490 






His 


Val 


Leu 


Arg 


505 








Ser 


Gly 


Gly 


Ala 


Met 


Tyr 


Leu 


Leu 








540 


Gly 


Leu 


Gly 


Gin 






555 




Leu 


Asn 


Arg 


Ala 




570 






Ala 


Val 


Ser 


Ser 


585 








Arg 


Leu 


Glu 


Pro 


Phe 


Lys 


Phe 


Glu 








620 


Ser 


Ser 


Glu 


Val 






635 




Trp 


Gly 


Asp 


Ala 




650 






Ala" 


Phe 


Phe 


Ala 


665 








Trp 


Gly Asp 


Met 


Phe 


Phe 


Cys 


Pro 








700 


Ser 


Glu 


Glu 


Glu 



715 



Leu Glu Ser Tyr 
255 

He Asp He Pro 
270 

Leu Thr Arg He 
285 

Val Ala Gly Ser 

Asp Thr Leu Ala 
320 

Asp Arg He Arg 
335 

Ala Gin Val Glu 
350 

Ser Ser Glu Asp 
365 

Leu Val Lys Ala 

Leu Arg Leu Ala 
400 

Leu Phe Arg Gly 
415 

Leu Met Asp Ala 
430 

He Ser His Gly 
445 

Ala Gin Leu Tyr 

Leu Asp Gin Ala 
480 

Gly Gly Ala Ala 
495 

Met Leu Leu Gly 
510 

Trp Asp Pro His 

525 

Ser Asp Lys Ala 

Ala Pro Trp Ser 
560 

Gin Met Ala Met 
575 

Ala Leu Gly Ala 
590 

Asp Ala Glu Glu 
605 

Gly Met Gly Val 

Arg Ala Ala Arg 
640 

Thr Cys Leu Gin 
655 

Gin Asp Gly Val 
670 

Ala Ser Thr Thr 
685 

Pro Leu He Tyr 

Pro Thr Arg Glu 
720 
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Glu 


Leu 


Glu 


Phe 


Asp 
725 


Met 


Asp 


Ser 


Val 


He 
730 


Asn 


Gly 


Glu 


Gly 


Pro 
735 


Val 


Glv 


Thr 


Ala 


Asp 


Pro 


Ala 


Glu 


Lys 


Thr 


Pro 


Leu 


Gly 


Val 


Pro 


Arg 


Gin 






740 










745 










750 






Ser 


Glv 


Arq 


Pro 


Gly 


Cys 


Cys 


Gly Gly 


Arg 


Cys 


Gly 


Gly Arg 


Arg 


Cys 




755 










760 










765 








Leu 


770 


Ara 


Tro 


Phe 


His 


Phe 
775 


Tro 


Glv 


Ala 


Pro 


Val 
780 


Thr 


He 


Phe 


Met 


Glv 


Asn 


Val 


Val 


Ser 


Tyr 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Phe 


Ser 


Arg 


Val 


785 










790 










795 










800 


Leu 


Leu 


Val 


Asp 


Phe 


Gin 


Pro 


Ala 


Pro 


Pro 


Gly 


Ser 


Leu 


Glu 


Leu 


Leu 








805 










810 










815 




Leu 


Tvr 


Phe 


Tro 


Ala 


Phe 


Thr 


Leu 


Leu 


Cys 


Glu 


Glu 


Leu 


Arg 


Gin 


Gly 






820 










825 










830 






Leu 


Ser 


Gly 


Glv 


Glv 


Glv 

VI J. jr 


Ser 


Leu 


Ala 


Ser Gly Gly 


Pro 


Glv 
oxy 


Pro 


Gly 






835 










O fi u 


















His 


Ala 
850 


Ser 


Leu 


Ser 


Gin 


Arg 
855 


Leu 


Arg 


Leu 


Tyr 


Leu 
860 


Ala 


Asp 


Ser 


Trp 


Asn 


Gin 


Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly 


Val 


Gly 


865 






870 










875 










880 


Cys 


ArQ 


Leu 


Thr 


Pro 


Glv 


Leu 


Tyr 


His 


Leu 


Gly Arg 


Thr 


Val 


Leu 


Cys 






885 










890 










895 






Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 






900 










905 










910 








Lys 


Gin 


Leu 


Glv 


Pro 


Lvs 


He 


Val 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 




915 










920 










925 








Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Gly Val 


Trp 


Leu 


Val 


Ala 


Tyr 


930 










935 










940 










Gly 


Val 


Ala 


Thr 


Glu 


Gly 


Leu 


Leu 


Arg 


Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 


945 










950 










955 










960 


Ser 


He 


Leu 


Arg 


Arg 


Val 


Phe 


Tyr 


Arg 


Pro 


Tyr 


Leu 


Gin 


He 


Phe 


Gly 








965 










970 










975 




Gin 


He 


Pro 


Gin 
980 


Glu 


Asp 


Met 


Asp 


Val 
985 


Ala 


Leu 


Met 


Glu 


His 
990 


Ser 


Asn 


Cys 


Ser 


Ser 


Glu 


Pro 


Gly 


Phe 


Trp 


Ala 


His 


Pro 


Pro 


Gly Ala 


Gin 


Ala 




995 










1000 








1005 






Gly 


Thr 


Cys 


Val 


Ser 


Gin 


Tyr 


Ala 


Asn 


Trp 


Leu 


Val 


Val 


Leu 


Leu 


Leu 


1010 








1015 








1020 








Val 


He 


Phe 


Leu 


Leu 


Val 


Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


1025 








1030 








1035 








1040 


Ala 


Met 


Phe 


Ser 


Tyr 


Thr 


Phe 


Gly 


Lys 


Val 


Gin Gly 


Asn 


Ser 


Asp 


Leu 










1045 








1050 








1055 


Tyr 


Trp 


Lys 


Ala 


Gin 


Arg 


Tyr 


Arg 


Leu 


He 


Arg 


Glu 


Phe 


His 


Ser 


Arg 




1060 








1065 








1070 




Pro 


Ala 


Leu 


Ala 


Pro 


Pro 


Phe 


He 


Val 


He 


Ser 


His 


Leu 


Arg 


Leu 


Leu 






1075 








1080 








1085 






Leu 


Arg 


Gin 


Leu 


Cys 


Arg 


Arg 


Pro 


Arg 


Ser 


Pro 


Gin 


Pro 


Ser 


Ser 


Pro 




1090 








1095 








1100 








Ala 


Leu 


Glu 


His 


Phe 


Arg 


Val 


Tyr 


Leu 


Ser 


Lys 


Glu 


Ala 


Glu 


Arg 


Lys 


1105 








1110 








1115 








1120 


Leu 


Leu 


Thr 


Trp 


Glu 


Ser 


Val 


His 


Lys 


Glu 


Asn 


Phe 


Leu 


Leu 


Ala 


Arg 








1125 








1130 








1135 


Ala 


Arg 


Asp 


Lys 


Arg 


Glu 


Ser 


Asp 


Ser 


Glu 


Arg 


Leu 


Lys 


Arg 


Thr 


Ser 








1140 








1145 








1150 




Gin 


Lys 


Val 


Asp 


Leu 


Ala 


Leu 


Lys 


Gin 


Leu 


Gly 


His 


lie 


Arg 


Glu 


Tyr 




1155 








1160 








1165 






Glu 


Gin 


Arg 


Leu 


Lys 


Val 


Leu 


Glu 


Arg 


Glu 


Val 


Gin 


Gin 


Cys 


Ser 


Arg 




1170 








1175 








1180 








Val 


Leu 


Gly 


Trp 


Val 


Ala 


Glu 


Ala 


Leu 


Ser 


Arg 


Ser 


Ala 


Leu 


Leu 


Pro 


1185 




1190 








1195 








1200 
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Pro Gly Gly Pro Pro Pro Pro Asp Leu Pro Gly Ser Lys Asp 
1205 1210 

<210> 31 

<211> 4646 

<212> DNA 

<213> Homo Sapiens 



<400> 
tcgacccacg 
tccgcccacg 
agatgtcctt 
gcacccggac 
acttggtgaa 
attccaaggc 
gcacccagat 
ccgacgcctt 
tgtcctgcga 
aaacacccaa 
gcatgcgcaa 
tcacgggagg 
ccatcagcag 
tctccaaccg 
accttatgga 
tgctgctcgt 
agctagagaa 
ccattgtgtg 
tcaaaaataa 
ctagcctggt 
gctttttacc 
ggctcaaaga 
gggatgaaat 
agcaagacaa 
acttagccaa 
aagtcatgtt 
atggcttgaa 
acttcagcac 
tcctcacgtt 
gaaatggccg 
ccctgcaagc 
tttgggagca 
ctctggccaa 
agtacgagac 
cagaacagct 
cggtggaggc 
agcaatggta 
ttattatacc 
acaagaagct 
ggaatgtggt 
tccattcggt 
gtgatgaagt 
tgatggacac 
ctaataaaag 
ctctaagatt 
tgcagaggat 
cctttggcgt 
tccgttcggt 
atggtaccac 
gtgtggagct 
tgtgcatcta 



31 

cgtccgccca 
cgtccggggt 
tcgggcagcc 
cctgtactcc 
ttttattcaa 
cacggagaat 
caaccaaagt 
tggggatatt 
cacggacgcg 
cctggtcatt 
gatcttcagc 
cacccattat 
gagttcagag 
ggacaccctc 
tgacttcaca 
ggacaatggc 
gtatatctct 
ttttgcccaa 
aattccttgt 
ggaggtggag 
ccgcacggtg 
aattctcgaa 
tgtgagcaat 
ggataactgg 
tgatgagatt 
tacggctctc 
cctacggaag 
gcttgtgtac 
tgtctggaaa 
ggacgagatg 
tctcttcatc 
gaccaggggc 
agtgaagaac 
ccgggctgtt 
gctggtctat 
cacagaccag 
tggagagatt 
cttggtgggc 
gctttggtac 
cttctacatc 
gccacacccc 
gagacagtgg 
gctggggctt 
ctctttgtat 
gatccacatt 
gctgatcgat 
ggccaggcaa 
catctacgag 
gtatgacttt 
ggatgagcac 
catgttatcc 



cgcgtccgcc 
gaaagmramy 
aggctcagca 
agcgcgtctc 
gcaaatttta 
gtgtgcaagt 
gagaaatgga 
cagtttgaga 
gaaatccttt 
tctgtgaccg 
cggctcatct 
ggcctgatga 
gagaatattg 
atcaggaatt 
agagatccac 
tgtcatggac 
gagcgcacta 
ggaggtggaa 
gtggtggtgg 
gatgccctga 
tcccggctgc 
tgttctcacc 
gccatctcct 
aatgggcagc 
ttcaccaatg 
ataaaggaca 
tttctcaccc 
cggaatctgc 
ctggttgcga 
gacatagaac 
tgggccattc 
tgcactctgg 
gacatcaatg 
gagctgttca 
tcctgtgaag 
catttcatcg 
tcccgagaca 
tgtggctttg 
tatgtggcgt 
gccttcctcc 
cccgagctgg 
tacgtaaatg 
ttttacttca 
tctggacgag 
tttactgtaa 
gtgttcttct 
gggatcctta 
ccctacctgg 
gcccactgca 
aacctgcccc 
accaacatcc 



cacgcgtccg 
cmygcktsms 
tgaggaacag 
ggagcacaga 
agaaacgaga 
gtggctatgc 
actacaagaa 
cactggggaa 
acgagctgct 
ggggcgccaa 
acatcgcgca 
agtacatcgg 
tggccattgg 
gcgatgctga 
tgtgtatcct 
atcccactgt 
ttcaagattc 
aagagacttt 
aaggctcggg 
catcttctgc 
ctgaggagga 
tattaacagt 
acgctctata 
tgaagcttct 
accgccgatg 
gacccaagtt 
atgatgtcct 
agatcgccaa 
acttccgaag 
tccacgacgt 
ttcagaataa 
cagccctggg 
ctgctgggga 
ctgagtgtta 
cttggggtgg 
cccagcctgg 
ccaagaactg 
tatcatttag 
tcttcacctc 
tgctgtttgc 
tcctgtactc 
gggtgaatta 
tagcaggaat 
tcattttctg 
gcagaaactt 
tcctgttcct 
ggcagaatga 
ccatgttcgg 
ccttcactgg 
ggttccccga 
tgctggtcaa 



cccacgcgtc 
aaaaaccgtc 
aaggaatgac 
cttgtcttac 
atgtgtcttc 
ccagagccag 
acacaccaag 
gaaagggaag 
gacccagcac 
gaacttcgcc 
gtccaaaggt 
ggaggtggtg 
catagcagct 
gggctatttt 
ggacaacaac 
cgaagcaaag 
caactatggt 
gaaagccatc 
ccagatcgct 
cgtcaaggag 
gactgagagt 
tattaaaatg 
caaagccttc 
gctggagtgg 
ggagtctgct 
tgtccgcctc 
cactgaactc 
gaattcctat 
aggcttccgg 
gtctcctatt 
gaaggaactc 
agccagcaag 
gtccgaggag 
cagcagcgat 
aagcaactgt 
ggtccagaat 
gaagattatc 
gaagaaacct 
ccccttcgtg 
ctacgtgctg 
gctggtcttt 
ttttactgac 
tgtatttcgg 
tctggactac 
aggacccaag 
ctttgcggtg 
gcagcgctgg 
ccaggtgccc 
gaatgagtcc 
gtggatcacc 
cctgctggtc 



cgcccacgcg 
acttaggaaa 
actctggaca 
agtgaaagcg 
tttaccaaag 
cacatggaag 
gaatttccta 
tatatacgtc 
tggcacctga 
ctgaagccgc 
gcttggattc 
agagataaca 
tggggcatgg 
ttagcccagt 
cacacacatt 
ctccggaatc 
ggcaagatcc 
aatacctcca 
gatgtgatcg 
aagctggtgc 
tggatcaaat 
gaagaagctg 
agcaccagtg 
aaccagctgg 
gaccttcaag 
tttctggaga 
ttctccaacc 
aatgatgccc 
aaggaagaca 
actcggcacc 
tccaaagtca 
cttctgaaga 
ctggctaatg 
gaagacttgg 
ctggagctgg 
tttctttcta 
ctgtgtctgt 
gtcgacaagc 
gtcttctcct 
ctcatggatt 
gtcctcttct 
ctgtggaatg 
ctccactctt 
attattttca 
attataatgc 
tggatggtgg 
aggtggatat 
agtgacgtgg 
aagccactgt 
atccccctgg 
gccatgtttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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gctacacggt 
tcctggtgca 
acttctacat 
cttctgtctg 
aggaaaacta 
atcgatttag 
ctaataaaat 
tcatattaag 
acccctgggt 
aagtgtgtga 
gtttctccag 
agtttaagtg 
tataggagaa 
ttttgttcat 
ttggcatatt 
ggaccccgct 
ggaggtgaca 
cccccattcc 
tttgttctta 
attatagaaa 
taaggtaatc 
tgtctcaatt 
gtaagctttt 
aacattatat 
tgaaggctat 
tatttgggaa 
aaaaaaaagg 



gggcaccgtc 
ggagtactgc 
ggtggtgaag 
ctgtttcaaa 
ccttgtcaag 
acaactggat 
caaataaaac 
gaatgctgat 
acatggtgga 
ttggtttcat 
aatggtgcct 
tgttcttacc 
catctatcct 
ttccaattga 
gtwaaaagtc 
ctcttcagga 
aggcagtctc 
taaatatgtg 
ttggatactc 
atttagacca 
actgctaaca 
ctctttcaaa 
tcacttagta 
caataattgc 
ctccagttga 
aattttccaa 
gcggccgctc 



caggagaaca 
agccgcctca 
aagtgcttca 
aatgaagaca 
atcaacacaa 
acaaagctta 
tgtatgaact 
gaacaatttt 
tgattttaaa 
acttgaagac 
gtttctctct 
gcctcctttt 
atgaataaga 
ttctctactt 
tctcaaatta 
aaagtgtttt 
ttgctctctt 
aaaagtcgcc 
ctcttattta 
tacagagatg 
atttctggat 
attttacaga 
ttttatcaaa 
ataataggca 
tcattgggat 
ggttagattc 
tagagt 



-50- 
atgaccaggt 
atatcccctt 
agtgttgctg 
atgagactct 
aagccaacga 
atgatctcaa 
ctaatggaga 
gctatcgact 
tcaccctagt 
ggatataaag 
gtgtctcaat 
tcctttaatc 
acctggtcat 
ttcccttttt 
ggccagattc 
catttctcag 
ggactcacca 
caaaatgcaa 
ttatttttcc 
tagaaagaac 
ggtttttcaa 
atgttatcat 
tatgttttta 
acctctagcg 
gagcatcttt 
caataaatat 



ctggaagttc 
ccccttcatc 
caaggagaaa 
ggcatgggag 
cacctcagag 
gggtcttctg 
aaaatctaat 
actaaatgag 
gtgctgagac 
gaagaatatt 
gcctgggact 
ttatttttga 
gctttactcc 
tgtattatgt 
taaaacatgc 
gatgcttctt 
ggctcctatt 
ccttgaaagg 
attaaaaata 
ataaattgtc 
gtctattttt 
actacatata 
ttatattcat 
attaccataa 
gtgcatgaat 
ctatttatta 



cagaggtact 

gtcttcgctt 

aacatggagt 

ggtgtcatga 

gaaatgaggc 

aaagagattg 

tatagcaaga 

agattttcag 

cttgagaata 

tcctttatgt 

ggaggttgat 

tgaacacata 

tgtattgtta 

gactaattag 

tgcagcaaga 

acctgtcaga 

gaaggaacca 

cactactgac. 

atagctggct 

cccattacct 

tttctatgta 

tactttttat 

agccttctta 

ttttgctcat 

cctattgctg 

ttaaaaaaaa 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4646 
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<213> Homo Sapiens 



<400> 32 



Met 


Ser 


Phe 


Arg 


Ala 


Ala 


Arg 


Leu 


Ser 


Met 


Arg 


Asn 


Arg 


Arg 


Asn 


Asp 


1 






5 










10 










15 




Thr 


Leu 


Asp 


Ser 


Thr 


Arg 


Thr 


Leu 


Tyr 


Ser 


Ser 


Ala 


Ser 


Arg 


Ser 


Thr 






20 










25 










30 






Asp 


Leu 


Ser 


Tyr 


Ser 


Glu 


Ser 


Asp 


Leu 


Val 


Asn 


Phe 


He 


Gin 


Ala 


Asn 




35 










40 










45 








Phe 


Lys 


Lys 


Arg 


Glu 


Cys 


Val 


Phe 


Phe 


Thr 


Lys 


Asp 


Ser 


Lys 


Ala 


Thr 




50 








55 










60 










Glu 


Asn 


Val 


Cys 


Lys 


Cys 


Gly 


Tyr 


Ala 


Gin 


Ser 


Gin 


His 


Met 


Glu 


Gly 


65 








70 










75 










80 


Thr 


Gin 


He 


Asn 


Gin 


Ser 


Glu 


Lys 


Trp 


Asn 


Tyr 


Lys 


Lys 


His 


Thr 


Lys 










85 










90 










95 




Glu 


Phe 


Pro 


Thr 


Asp 


Ala 


Phe 


Gly 


Asp 


He 


Gin 


Phe 


Glu 


Thr 


Leu 


Gly 








100 










105 










110 






Lys 


Lys 


Gly 


Lys 


Tyr 


He 


Arg 


Leu 


Ser 


Cys 


Asp 


Thr 


Asp 


Ala 


Glu 


He 






115 










120 










125 








Leu 


Tyr 


Glu 


Leu 


Leu 


Thr 


Gin 


His 


Trp 


His 


Leu 


Lys 


Thr 


Pro 


Asn 


Leu 




130 










135 










140 










Val 


He 


Ser 


Val 


Thr 


Gly 


Gly 


Ala 


Lys 


Asn 


Phe 


Ala 


Leu 


Lys 


Pro 


Arg 


145 










150 










155 










160 


Met 


Arg 


Lys 


He 


Phe 


Ser 


Arg 


Leu 


He 


Tyr 


He 


Ala 


Gin 


Ser 


Lys 


Gly 






165 










170 










175 




Ala 


Trp 


He 


Leu 


Thr 


Gly 


Gly 


Thr 


His 


Tyr 


Gly 


Leu 


Met 


Lys 


Tyr 


He 






180 










185 










190 






Gly 


Glu 


Val 


Val 


Arg 


Asp 


Asn 


Thr 


He 


Ser 


Arg 


Ser 


Ser 


Glu 


Glu 


Asn 



195 200 205 
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lie 


Val 


Ala 


He 


Gly 


I le 




210 










Thr 


Leu 


lie 


Arg 


Asn 


Cys 


225 










O O rt 


Leu 


Met 


Asp 


Asp 


Phe 


Thr 










245 




His 


Thr 


His 


Leu 


Leu 


Leu 








260 






Val 


Glu 


Ala 


Lys 


Leu 


Arg 






275 








Thr 


lie 


Gin 


Asp 


Ser 


Asn 




290 










Ala 


Gin 


Gly 


Gly 


Gly 


Lys 


305 










310 


Lys 


Asn 


Lys 


He 


Pro 


Cys 










325 




Asp 


Val 


He 


Ala 


Ser 


Leu 








340 






Ala 


Val 


Lys 


Glu 


Lys 


Leu 






355 








Leu 


Pro 


Glu 


Glu 


Glu 


Thr 




370 










Leu 


Glu 


Cys 


Ser 


His 


Leu 


385 










390 


Asp 


Glu 


He 


Val 


Ser 


Asn 








405 




Ser 


Thr 


Ser 


Glu 


Gin 


Asp 








420 






Leu 


Leu 


Glu 


Trp 


Asn 


Gin 






435 








Asn 


Asp 


Arg 


Arg 


Trp 


Glu 




450 










Ala 


Leu 


He 


Lys 


Asp 


Arg 


465 










470 


Gly 


Leu 


Asn 


Leu 


Arg 


Lys 










485 




Phe 


Ser 


Asn 


His 


Phe 


Ser 








500 






Lys 


Asn 


Ser 


Tyr 


Asn 


Asp 






515 








Ala 


Asn 


Phe 


Arg 


Arg 


Gly 




530 










Glu 


Met 


Asp 


He 


Glu 


Leu 


545 










550 


Leu 


Gin 


Ala 


Leu 


Phe 


He 










565 




Ser 


Lys 


Val 


He 


Trp 


Glu 








D O U 






Gly 


Ala 


Ser 


Lys 


Leu 


Leu 






595 








Asn 


Ala 


Ala 


Gly 


Glu 


Ser 




610 










Ala 


Val 


Glu 


Leu 


Phe 


Thr 


625 










630 


Glu 


Gin 


Leu 


Leu 


Val 


Tyr 










645 




Leu 


Glu 


Leu 


Ala 


Val 


Glu 








660 






Gly 


Val 


Gin 


Asn 


Phe 


Leu 



675 
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Ala Ala Trp Gly Met Val 
215 220 
Asp Ala Glu Gly Tyr Phe 
235 

Arg Asp Pro Leu Cys He 
250 

Val Asp Asn Gly Cys His 
265 

Asn Gin Leu Glu Lys Tyr 
280 

Tyr Gly Gly Lys He Pro 
295 300 
Glu Thr Leu Lys Ala He 
315 

Val Val Val Glu Gly Ser 
330 

Val Glu Val Glu Asp Ala 
345 

Val Arg Phe Leu Pro Arg 
360 

Glu Ser Trp He Lys Trp 
375 380 
Leu Thr Val He Lys Met 
395 

Ala He Ser Tyr Ala Leu 
410 

Lys Asp Asn Trp Asn Gly 
425 

Leu Asp Leu Ala Asn Asp 
440 

Ser Ala Asp Leu Gin Glu 
455 460 
Pro Lys Phe Val Arg Leu 
475 

Phe Leu Thr His Asp Val 
4 90 

Thr Leu Val Tyr Arg Asn 

505 

Ala Leu Leu Thr Phe Val 
520 

Phe Arg Lys Glu Asp Arg 
535 540 
His Asp Val Ser Pro He 
555 

Trp Ala He Leu Gin Asn 
570 

Gin Thr Arg Gly Cys Thr 
585 

Lys Thr Leu Ala Lys Val 
600 

Glu Glu Leu Ala Asn Glu 
615 620 
Glu Cys Tyr Ser Ser Asp 
635 

Ser Cys Glu Ala Trp Gly 
650 

Ala Thr Asp Gin His Phe 
665 

Ser Lys Gin Trp Tyr Gly 
680 
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Ser 


Asn 


Arg 


Asp 


Leu 


Ala 


Gin 


Tyr 










Leu 


Asp 


Asn 


Asn 






255 




Gly 


His 


Pro 


Thr 




270 






He 


Ser 


Glu 


Arg 


285 








He 


Val 


Cys 


Phe 


Asn 


Thr 


Ser 


He 








320 


Gly 


Gin 


He 


Ala 






335 




Leu 


Thr 


Ser 


Ser 




350 






Thr 


Val 


Ser 


Arg 


365 








Leu 


Lys 


Glu 


He 


Glu 


Glu 


Ala 


Gly 








400 


Tyr 


Lys 


Ala 


Phe 






415 




Gin 


Leu 


Lys 


Leu 




430 






Glu 


He 


Phe 


Thr 


445 








Val 


Met 


Phe 


Thr 


Phe 


Leu 


Glu 


Asn 








480 


Leu 


Thr 


Glu 


Leu 






495 




Leu 


Gin 


He 


Ala 




510 






Trp 


Lys 


Leu 


Val 


525 








Asn 


Gly 


Arg 


Asp 


Thr 


Arg 


His 


Pro 








CI ^ rt 


Lys 


Lys 


Glu 


Leu 






57 5 




Leu 


Ala 


AX a 


Leu 










Lys 


Asn 


Asp 


X xe 


605 








Tyr 


Glu 


Thr 


Arg 


Glu 


Asp 


Leu 


Ala 








640 


Gly 


Ser 


Asn 


Cys 






655 




He 


Ala 


Gin 


Pro 




670 






Glu 


He 


Ser 


Arg 


685 









wo 00/40614 PCT/US99/29996 

-52- 



Asp 


Thr 


Lys 


Asn 


Trp 


Lys 


±±e 




Leu 


Cys 


Leu 


Phe 


He 


He 


Pro 


Leu 


690 








695 










700 










Val 


Gly 


Cys 


Gly 


fne 


va J- 


oer 


.He 


Arg 


ys 


Lys 


Pro 


Val 


Asp 


Lys 


His 


705 






/ xu 










7 15 










720 


Lys 


Lys 


Leu 


Leu 


Trp 


Tyr 


Tyr 


V ax 




Phe 


Phe 


Thr 


Ser 


Pro 


Phe 


Val 






725 




















7 S 




Val 


Phe 


Ser 


Trp 
740 


Asn 


Val 


vaJ. 


irne 


Tyr 

i H ZJ 


T T o 
J. Xc 






Leu 


Leu 
750 




Phe 


Ala 


Tyr 


Val 


Leu 


Leu 


Met 


Asp 


Pne 


nXS 


oer 


VaJ. 


irro 




Pro 


Pr-ri 


Glu 




755 










T cn 










765 








Leu 


Val 


Leu 


Tyr 


Ser 


Leu 


Val 


?ne 


vax 


Leu 


fne 


Cys 


Asp 


Glu 


V a J. 


Arg 




770 








775 










/ o u 










Gin 


Trp 


Tyr 


Val 


Asn 


Gly 


Val 


Asn 


Tyr 


Pne 


Thr 


Asp 


Leu 


Trp 


Asn 




785 






790 










/ yo 










800 


Met 


Asp 


Thr 


Leu 


Gly 


Leu 


Phe 


Tyr 


Phe 


lie 


A 1 -a 

AX a 


Gxy 


He 


Val 


r ne 


Arg 








805 










o xu 










O X ^ 




Leu 


His 


Ser 


Ser 


Asn 


Lys 


Ser 


Ser 


Leu 


Tyr 


Ser 


VaXy 


Arg 


Val 


T 1 

xxe 


irne 








820 








Q O R 










830 






Cys 


Leu 


Asp 


Tyr 


He 


He 


Phe 


Thr 


Leu 


Arg 


Leu 


T 1 A 

xxe 


His 


He 


Irne 


A 11 JT 




835 










Q A f\ 










845 








Val 


Ser 


Arg 


Asn 


Leu 


Gly 


Pro 


Lys 


He 


He 


Met 


Leu 


Gin Arg 


nei. 


x«eu 




850 








855 










DOU 










He 


Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Leu 


fne 


AX a 


vax 


Trp 


Met 


V ClX 


Ala 


865 








870 










o / o 










o o u 


Phe 


Gly 


Val 


Ala 


Arg 


Gin 


Gly 


He 


Leu 


Arg 


bxn 


Asn 


Glu 


Gin 


Arg 


Trp 








885 










oyu 










O 7^ 




Arg 


Trp 


lie 


Phe 


Arg 


Ser 


Val 


He 


Tyr 


VaJ-U 


Pro 


Tyr 


Leu 


Ala 




zrxics 




900 


















910 






Gly 


Gin 


Val 


Pro 


Ser 


Asp 


Val 


Asp 


Gly 


Thr 


1 nr 


Tyr 


Asp 


Phe 




His 




915 










920 










925 








Cys 


Thr 


Phe 


Thr 


Gly Asn 


Glu 


Ser 


Lys 


Pro 


Leu 


Cys 


Val 


Glu 


- 

Leu 


Asp 


930 










935 










940 










Glu 


His 


Asn 


Leu 


Pro 


Arg 


Phe 


Pro 


Glu 


Trp 


He 


Thr 


He 


Pro 


Leu 


Val 


945 










950 










955 










960 


Cys 


He 


Tyr 


Met 


Leu 


Ser 


Thr 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


Val 






965 










970 










975 




Ala 


Met 


Phe 


Gly 


Tyr 


Thr 


Val 


Gly 


Thr 


Val 


Gin 


Glu 


Asn 


Asn 


Asp 


Gin 








980 








985 










990 






Val 


Trp 


Lys 


Phe 


Gin 


Arg 


Tyr 


Phe 


Leu 


Val 


Gin 


Glu 


Tyr 


Cys 


Ser 


Arg 




995 










1000 








1005 






Leu 


Asn 


He 


Pro 


Phe 


Pro 


Phe 


He 


Val 


Phe 


Ala 


Tyr 


Phe 


Tyr 


Met 


Val 




1010 








1015 








1020 








Val 


Lys 


Lys 


Cys 


Phe 


Lys 


Cys 


Cys 


Cys 


Lys 


Glu 


Lys 


Asn 


Met 


Glu 


Ser 


1025 








1030 








1035 








104 


Ser 


Val 


Cys 


Cys 


Phe 


Lys 


Asn 


Glu 


Asp 


Asn 


Glu 


Thr 


Leu 


Ala 


Trp 


Glu 






1045 








1050 








1055 


Gly 


Val 


Met 


Lys 


Glu 


Asn 


Tyr 


Leu 


Val 


Lys 


He 


Asn 


Thr 


Lys 


Ala 


Asn 






1060 








1065 








1070 




Asp 


Thr 


Ser 


Glu 


Glu 


Met 


Arg 


His 


Arg 


Phe 


Arg 


Gin 


Leu 


Asp 


Thr 


Lys 



1075 1080 1085 



Leu Asn Asp Leu Lys Gly Leu Leu Lys Glu He Ala Asn Lys He Lys 
1090 1095 1100 
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CHARACTERIZATION OF A CALCIUM CHANNF.I. FAMn.Y 

Field of the Invention 

This invention relates to nucleic acids coding for a novel family of calcium channel 
polypeptides, the encoded polypeptides, unique fragments of the foregoing, and methods of 
making and using same. 

Background of the Invention 

Calcium channels are membrane-spanning, muiti-subunit proteins that facilitate the 
controlled transport ("flux") of Ca^"*^ ions into and out of cells. Cells throughout the animal 
kingdom, and at least some bacterial, fungal and plant cells, possess one or more types of 
calciimi channels. In general, "excitable"' cells, such as neurons of the central nervous 
system, peripheral nerve cells, and muscle cells, including those of skeletal muscles, cardiac 
muscles, and venous and arterial smooth muscles, possess voltage-dependent calciimi 
channels. In a voltage-dependent calcium channel, the transport of Ca^*** ions into and out of 
the cells requires a certain minimal level of depolarization (the difference in potential between 
the inside of the cell bearing the channel and the extracellular environment) vdth the rate of 
Ca^"*" cell flux dependent on the difference in potential. In "non-excitable" cells, calcium 
influx is thought to occur predominantly in response to stimuli which cause the release of 
calcium from intracellular stores. This process, termed store operated calcium influx, is not 
well understood. 

Characterization of a particular type of calcium channel by analysis of whole cells is 
complicated by the presence of mixed populations of different types of calcium channels in 
the majority of cells. Although single-channel recording methods can be used to examine 
individual calcium channels, such analysis does not reveal information related to the 
molecular structure or biochemical composition of the channel. Furthermore, in this type of 
analysis, the channel is isolated from other cellular constituents that might be important for 
the channel's natural functions and pharmacological interactions. To study the calcium 
channel structure-function relationship, large amoimts of pure channel protein are needed. 
However, acquiring large amounts of pure protein is difficult in view of the complex nature of 
these multisubunit proteins, the varying concentrations of calcium channel proteins in tissue 
sources, the presence of mixed populations of calcium channel proteins in tissues, and the 
modifications of the native protein that can occur during the isolation procedure. 
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Summary of the Invention 

The invention is based on the identification of a novel family of calcium channel 
polypeptides and the molecular cloning and partial characterization of a novel member of this 
family that is expressed predominantly in human hematopoietic cells, liver, and kidney. This 
newly identified family of calcium channel polypeptides is designated, "SOC" or "CRAC" or 
"ICRAC", for Store Operated Channels or Calcium Release Activated Channels. Although 
not wishing to be bound to any particular theory or mechanism, it is believed that the 
SOC/CRAC calcium channel polypeptides are transmembrane polypeptides that modulate 
Ca^"" flux "into" and "out of a cell, for example, in certain instances they may be activated 
upon depletion of Ca^^ from intracellular calcium stores, allowing Ca^^ influx into the cell. 
Accordingly, the compositions disclosed herein are believed to be useful for modulating 
calcium transport into and out of such intracellular stores and for the treatment of disorders 
that are characterized by aberrant calcium transport into and out of such intracellular stores. 
In particular, we believe that the SOC/CRAC calcium channel polypeptides disclosed herein 
play an important role in the influx of extracellular calcium by mediating the refilling of 
intracellular calcium stores following their depletion. Accordingly, we believe that the 
compositions for expressing functional SOC/CRAC calcium channel polypeptides in cells, as 
disclosed herein, are useful for treating patients having conditions that are characterized by 
reduced extracellular calcium influx into their SOC/CRAC-expressing cells. Additionally, the 
compositions of the invention are useftil for delivering therapeutic and/or imaging agents to 
cells which preferentially express SOC/CRAC calcium channel polypeptides and, in 
particular, for delivering such agents to hematopoietic cells, liver, heart, spleen, and kidney to 
modulate proliferation and growth of these cells. Moreover, in view of the importance of 
cellular calcium levels to cell viability, we believe that SOC-2/CRAC-1, SOC-3/CRAC-2, and 
SOC-4/CRAC-3 as disclosed herein, and/or other members of the SOC/CRAC family of 
calcium channel polypeptides, represent an ideal target for designing and/or identifying (e.g., 
from molecular libraries) small molecule inhibitors that block lymphocyte proliferation, as 
well as other binding agents that selectively bind to SOC/CRAC polypeptides to which drugs 
or toxins can be conjugated for delivery to SOC/CRAC polypeptide expressing cells. 

The invention is based, in part, on the molecular cloning and sequence analysis of the 
novel SOC/CRAC calcium channel molecules disclosed herein (also referred to as a "SOC- 
2/CRAC-l molecule,'' a "SOC-3/CRAC-2 molecule," and/or "SOC-4/CRAC-3 molecule") 
that are predominantly expressed in human hematopoietic cells, liver, spleen, heart, and 
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kidney (SOC-2/CRAC-1), kidney and colon (SOC-3/CRAC-2), and prostate (SOC-4/CRAC-3 
molecule). As used herein, a "SOC/CRAC molecule" embraces a "SOC/CRAC calcium 
channel nucleic acid" (or "SOC/CRAC nucleic acid") and a "SOC/CRAC calcium channel 
polypeptide" (or "SOC/CRAC polypeptide"). Homologs and alleles also are embraced within 
the meaning of a SOC/CRAC calcium channel molecule. 

According to one aspect of the invention, isolated SOC/CRAC nucleic acids which 
code for one or more member(s) of the SOC/CRAC family of calcium channel polypeptides 
or unique fragments thereof are provided. The isolated nucleic acids refer to one or more of 
the following: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic acid 
molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID 
NO:3 1 , and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

The invention in another aspect provides an isolated nucleic acid molecule selected 
from the group consisting of (a) a unique fragment of a nucleic acid molecule selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:29, and SEQ ID NO:31, (b) complements of (a), 
provided that the unique fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from a sequence group consisting of (1) sequences having 
the SEQ. ID NOS. or GenBank accession numbers of Table I, (2) complements of (1), and (3) 
fragments of (1) and (2). 

According to yet another aspect of the invention, isolated SOC/CRAC polypeptides 
are provided. The isolated SOC/CRAC polypeptide molecules are encoded by one or more 
SOC/CRAC nucleic acid molecules of the invention. Preferably, the SOC/CRAC polypeptide 
contains one or more polypeptides selected from the group consisting of the polypeptides 
having SEQ. ID Nos. 2, 4, 6, 8, 24, 26, 28, 30, and 32. In other embodiments, the isolated 
polypeptide may be a fragment or variant of the foregoing SOC/CRAC polypeptide molecules 
of sufficient length to represent a sequence imique within the human genome, and identifying 
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with a polypeptide that functions as a calcium channel, provided that the fragment excludes a 
sequence of contiguous amino acids identified in Table II, and/or excludes a sequence of 
contiguous amino acids encoded for by a nucleic acid sequence identified in Table I. In 
another embodiment, immunogenic fragments of the polypeptide molecules described above 
are provided. 

According to another aspect of the invention, isolated SOC/CRAC binding agents 
(e.g., polypeptides) are provided which selectively bind to a SOC/CRAC molecule (e.g., a 
SOC/CRAC polypeptide encoded by the isolated nucleic acid molecules of the invention). 
Preferably, the isolated binding agents selectively bind to a polypeptide which comprises the 
sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8. SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID 
NO:32, or unique fragments thereof. In the preferred embodiments, the isolated binding 
polypeptides include antibodies and fragments of antibodies (e.g.. Fab, F(ab)2, Fd and 
antibody fragments which include a CDR3 region which binds selectively to a SOC/CRAC 
polypeptide). Preferably, the antibodies for human therapeutic applications are human 
antibodies. 

According to another aspect of the invention, a pharmaceutical composition containing 
a pharmaceutically effective amount of an isolated SOC/CRAC nucleic acid, an isolated 
SOC/CRAC polypeptide, or an isolated SOC/CRAC binding polypeptide in a 
pharmaceutically acceptable carrier also is provided. The pharmaceutical compositions are 
useful in accordance with therapeutic methods disclosed herein. 

According to yet another aspect of the invention, a method for isolating a SOC/CRAC 
molecule is provided. The method involves: 

a) contacting a SOC/CRAC nucleic acid or a SOC/CRAC binding polypeptide with a 
sample that is believed to contain one or more SOC/CRAC molecules, under conditions to 
form a complex of the SOC/CRAC nucleic acid or the SOC/CRAC binding polypeptide and 
the SOC/CRAC molecule; 

b) detecting the presence of the complex; 

c) isolating the SOC/CRAC molecule from the complex; and 

d) determining whether the isolated SOC/CRAC molecule has SOC/CRAC calciimi 
channel activity. As used herein "SOC/CRAC calcium channel activity" refers to the 
transport of Ca^"^ into and out of intracellular stores that is mediated by a SOC/CRAC 
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polypeptide. In general, the SOC/CRAC calcium channel activity is initiated by a reduction 
or depletion of intracellular calcium stores. 

In certain embodiments, the SOC/CRAC nucleic acid is a SOC-2/CRAC-1 nucleic 
acid (e.g., a nucleic acid having SEQ. ID NO. 27, or complements thereof); in certain other 
embodiments, the SOC/CRAC nucleic acid is a SOC-3/CRAC-2 nucleic acid (e.g., a nucleic 
acid having SEQ. ID NO. 29, or complements thereof); in further embodiments, the 
SOC/CRAC nucleic acid is a SOC-4/CRAC-3 nucleic acid (e.g., a nucleic acid having SEQ. 
ID NO. 31, or complements thereof). In yet other embodiments, the SOC/CRAC polypeptide 
is a SOC-2/CRAC-I binding polypeptide (e.g., an antibody that selectively binds to a SOC- 
2/CRAC-l polypeptide). In yet further embodiments, the SOC/CRAC polypeptide is a SOC- 
3/CRAC-2 binding polypeptide (e.g., an antibody that selectively binds to a SOC-3/CRAC-2 
polypeptide). In some embodiments, the SOC/CRAC polypeptide is a SOC-4/CRAC-3 
binding polypeptide (e.g., an antibody that selectively binds to a SOC-4/CRAC-3 
polypeptide). In the preferred embodiments, the isolated binding polypeptides include 
antibodies and fragments of antibodies (e.g.. Fab, F(ab)2, Fd and antibody fragments which 
include a CDR3 region which binds selectively to a SOC-2/CRAC-1, to a SOC-3/CRAC-2, 
and/or to a SOC-4/CRAC-3 polypeptide). Preferably the isolated binding polypeptides or 
other binding agents selectively bind to a single SOC/CRAC molecule, i.e., are capable of 
distinguishing between different members of the SOC/CRAC family. Accordingly, one or 
more SOC/CRAC binding agents can be contained in a single composition (e.g., a 
pharmaceutical composition) to identify multiple SOC/CRAC molecules in vivo or in vitro. 

According to yet another aspect of the invention, a method for identifying agents 
useful in the modulation of SOC/CRAC calciiun charmel activity is provided. The method 
involves: 

a) contacting a SOC/CRAC polypeptide with a candidate agent suspected of 
modulating SOC/CRAC calcium channel activity, under conditions sufficient to allow the 
candidate agent to interact selectively v^ith (e.g. bind to) the SOC/CRAC polypeptide; 

b) detecting a Ca^"^ concentration of step (b) associated with the SOC/CRAC calcium 
channel activity of the SOC/CRAC polypeptide in the presence of the candidate agent; and 

c) comparing the Ca^"^ concentration of step (b) with a control Ca^"^ concentration of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates (increases or decreases) SOC/CRAC calcium channel activity. 
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According to another aspect of the invention, a method for identifying agents useful in 
the modulation of a SOC/CRAC polypeptide kinase activity is provided. The method 
involves: 

a) contacting a SOC/CRAC polypeptide with kinase activity with a candidate agent 
suspected of modulating SOC/CRAC kinase activity, under conditions sufficient to cdlow the 
candidate agent to interact with the SOC/CRAC polypeptide and modulate its kinase activity; 

b) detecting a kinase activity associated with the SOC/CRAC polypeptide in the 
presence of the candidate agent; and 

c) comparing the kinase activity of step (b) v^th a control kinase activity of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates (increases or decreases) SOC/CRAC kinase activity. In some 
embodiments the SOC/CRAC polypeptide comprises amino acids 999-1180 of the SOC- 
2/CRAC-l polypeptide (SEQ ID NO:24), or a fragment thereof that retains the kinase activity. 

According to yet another aspect of the invention, a method for determining the level of 
expression of a SOC/CRAC polypeptide in a subject is provided. The method involves: 

a) meastiring the expression of a SOC/CRAC polypeptide in a test sample, and 

b) comparing the measured expression of the SOC/CRAC polypeptide in the test 
sample to the expression of a SOC/CRAC polypeptide in a control containing a known level 
of expression to determine the level of SOC/CRAC expression in the subject. Expression is 
defined as SOC/CRAC mRNA expression or SOC/CRAC polypeptide expression. Various 
methods can be used to measure expression. The preferred embodiments of the invention 
utilize PCR and Northern blotting for measuring mRNA expression, and monoclonal or 
polyclonal SOC/CRAC antisera as reagents for measuring SOC/CRAC polypeptide 
expression. In preferred embodiments, the SOC/CRAC molecule (nucleic acid and/or 
polypeptide) is SOC-2/CRAC-L In other preferred embodiments, the SOC/CRAC molecule 
is SOC-3/CRAC-2. In yet further preferred embodiments, the SOC/CRAC molecule is SOC- 
4/CRAC-3. In certain embodiments, the test samples include biopsy samples and biological 
fluids such as blood. The method is useful, e.g., for assessing the presence or absence or 
stage of a proliferative disorder in a subject. 

The invention also contemplates kits comprising a package including assays for 
SOC/CRAC epitopes, SOC/CRAC nucleic acids, and instructions, and optionally related 
materials such as controls, for example, a number, color chart, or an epitope of the expression 
product of the foregoing isolated nucleic acid molecules of the invention for comparing, for 
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example, the level of SOC/CRAC polypeptides or SOC/CRAC nucleic acid forms (wild-type 
or mutant) in a test sample to the level in a control sample having a known amount of a 
SOC/CRAC nucleic acid or SOC/CRAC polypeptide. This comparison can be used to assess 
in a subject a risk of developing a cancer or the progression of a cancer. The kits may also 
5 include assays for other known genes, and expression products thereof, associated with, for 
example, proliferative disorders (e.g., BRCA, p53, etc.). In a preferred embodiment, the kit 
comprises a package containing: (a) a binding agent that selectively binds to an isolated 
nucleic acid of the invention or an expression product thereof to obtain a measured test value, 
(b) a control containing a known amount of a SOC/CRAC nucleic acid or a SOC/CRAC 
w polypeptide to obtain a measured control value, and (c) instructions for comparing the 
measured test value to the measured control value to determine the amount of SOC/CRAC 
nucleic acid or expression product thereof in a sample. 

The invention provides isolated nucleic acid molecules, unique fragments thereof, 
expression vectors containing the foregoing, and host cells cont£dning the foregoing. The 
15 invention also provides isolated binding polypeptides and binding agents which bind such 
polypeptides, including antibodies, and pharmaceutical compositions containing any of the 
compositions of the invention. The foregoing can be used, inter alia, in the diagnosis or 
treatment of conditions characterized by the aberrant ejqiression levels and/or the presence of 
mutant forms of a SOC/CRAC nucleic acid or polypeptide. The invention also provides 
20 methods for identifying agents that alter the function of the SOC/CRAC polypeptide. 

These and other aspects of the invention, as well as various advantages and utilities, 
will be more apparent with reference to the detailed description of the preferred embodiments. 

Brief Description of the Sequences 
SEQ ID NO: 1 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
25 SEQ ID NO:2 is the predicted amino acid sequence of the translation product of 

human SOC-2/CRAC-1 cDNA (SEQ ID NO:l). 

SEQ ID NO:3 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 
SEQ ID NO:4 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:3). 
30 SEQ ID NO:5 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO:6 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:5). 
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SEQ ID NO:7 is a partial nucleotide sequence of the mouse homologue (mSOC- 
2/CRAC-l) of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO: 8 is the predicted amino acid sequence of the translation product of the 
mSOC-2/CRAC-l cDNA (SEQ ID NO:7). 

SEQ ID NO:9 is the nucleotide sequence of the mouse MLSN-1 (SOC-1) cDNA. 

SEQ ID NO: 10 is the predicted amino acid sequence of the translation product of the 
mouse MLSN-1 (SOC-1) cDNA (SEQ ID NO:9). 

SEQ ID NO: 11 is the nucleotide sequence of a human calciimi charmel cDNA with 
GenBank Acc. no.: AB001535. 

SEQ ID NO: 12 is the predicted amino acid sequence of the translation product of the 
human calcium channel cDNA with GenBank Acc. no.: AB001535 (SEQ ID NO:ll). 

SEQ ID NO: 13 is the amino acid sequence of a C. Elegans polypeptide at the 
c05cl2.3 locus. 

SEQ ID NO: 14 is the amino acid sequence of a C Elegans polypeptide at the F54D1 

locus. 

SEQ ID NO: 15 is the amino acid sequence of a C Elegans polypeptide at the t01H8 

locus. 

SEQ ID NO: 16 is the nucleotide sequence of a mouse kidney cDNA with GenBank 
Acc. no.: AI226731. 

SEQ ID NO: 17 is the predicted amino acid sequence of the translation product of the 
mouse kidney cDNA with GenBank Acc. no.: AI226731 (SEQ ID NO: 16). 

SEQ ID NO: 18 is the nucleotide sequence of a human brain cDNA with GenBank 
Acc. no.: H18835. 

SEQ ID NO: 19 is the predicted amino acid sequence of the translation product of the 
human brain cDNA with GenBank Acc. no.: H18835 (SEQ ID NO:18). 

SEQ ID NO:20 is the nucleotide sequence of the human EST with GenBank Acc. no.: 
AA4 19592. 

SEQ ID NO:21 is the nucleotide sequence of the human EST with GenBank Acc. no.: 
AA419407. 

SEQ ID NO:22 is the nucleotide sequence of the mouse EST with GenBank Acc. no.: 
AI098310. 
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SEQ ID NO:23 is a partial nucleotide sequence of the human SOC-2/CRAC-1 cDNA 
that contains the SOC-2/CRAC-1 sequences of SEQ ID NO:l, SEQ ID NO:3, and SEQ ID 
NO:5. 

SEQ ID NO:24 is the predicted amino acid sequence of the translation product of 
human SOC-2/CRAC-1 cDNA (SEQ ID NO:23). 

SEQ ID NO:25 is a partial nucleotide sequence of the human SOC-3/CRAC-2 cDNA. 

SEQ ID NO:26 is the predicted amino acid sequence of the translation product of 
human SOC-3/CRAC-2 cDNA (SEQ ID NO:25). 

SEQ ID NO:27 is the full nucleotide sequence of the human SOC-2/CRAC-1 cDNA. 

SEQ ID NO:28 is the predicted amino acid sequence of the translation product of 
human SOC.2/CRAC-1 cDNA (SEQ ID NO:27). 

SEQ ID NO:29 is the full nucleotide sequence of the human SOC-3/CRAC-2 cDNA. 

SEQ ID NO:30 is the predicted amino acid sequence of the translation product of 
human SOC-3/CRAC-2 cDNA (SEQ ID NO:29). 

SEQ ID NO:31 is the full nucleotide sequence of the human SOC-4/CRAC-3 cDNA. 

SEQ ID NO:32 is the predicted amino acid sequence of the translation product of 
human SOC-4/CRAC-3 cDNA (SEQ ID NO:3 1 ). 

Brief Description of the Drawings 

Figure 1 is a schematic depicting the intron/exon organization of the chicken SOC- 
2/CRAC-l genomic sequence, as well as the putative transmembrane (TM) domains, and the 
targeting constructs utilized in the knockout experiments. 

Detailed Description of the Invention 

One aspect of the invention involves the partial cloning of cDNAs encoding members 
of a novel family of calcium channel polypeptides, referred to herein as "SOC/CRAC" 
(designated "SOC" or "CRAC" or "ICRAC", for Store Operated Channels or Calcium 
Release Activated Channels, or CECH). Although not intending to be bound to any particular 
mechanism or theory, we believe that a SOC/CRAC family member is a transmembrane 
calcium chaimel that modulates Ca^^ flux "into" and "out of a cell; in certain instances it may 
be activated upon depletion of Ca^"*" from intracellular calcium stores, allowing Ca^"^ influx 
into the cell. 

The first three isolated SOC/CRAC members disclosed herein, define a new family of 
calcitmi chaimels which is distinct from previously described calcium channels, such as 
voltage gated calcium channels, ryanodine receptor/inositol-I,4,5-tripho5phate receptor 
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channels, and Transient Receptor Potential (TRP) channels. The SOC/CRAC family of 
calcium channels exhibits high selectivity (with a Pca/PNa ratio near 1000), a unitary 
conductance below the detection level of the patch clamp method (the conductance estimated 
at approximately 0.2 picosiemens), and are subject to inhibition by high intracellular calcium 
levels. Although not intending to be bound to any particular mechanism or theory, we believe 
that SOC/CRAC cedciimi channels are responsible for the majority of, for example, calcium 
entry which occurs when intracellular calcium stores are depleted, and that SOC/CRAC 
currents are important for initiating various types of calcium -dependent processes. Thus, we 
believe that SOC/CRAC calcium channels play an important role in cellular calcium 
homeostasis by, e.g., modulating the supply of calcium to refill intracellular stores when 
depleted. 

The isolated full-length sequence of a representative, first member of the SOC/CRAC 
family, human SOC/CRAC nucleic acid (cDNA), SOC-2/CRAC-1, is represented as the 
nucleic acid of SEQ ID NO:27. This nucleic acid sequence codes for the SOC-2/CRAC-1 
polypeptide with the predicted amino acid sequence disclosed herein as SEQ ID NO:28. A 
homologous mouse cDNA sequence (>90% identity to the human at the nucleotide level) is 
represented as the nucleic acid of SEQ ID NO:7, and codes for a unique fragment of a mouse 
SOC-2/CRAC-1 polypeptide having the predicted, partial amino acid sequence represented as 
SEQ ID NO:8. Analysis of the SOC-2/CRAC-1 partial sequence by comparison to nucleic 
acid and protein databases show that SOC-2/CRAC-1 shares a limited homology to mouse 
MLSN-1 (SOC-1, SEQ ID NOs: 9 and 10). Limited homology is also shared between SOC- 
2/CRAC-l and three C. Elegans polypeptides (SEQ ID NOs: 13, 14, and 15), We further 
believe that SOC-2/CRAC-1 plays a role in the regulation of cellular Ca^^ fluxing and, in 
particular, lymphocyte Ca^"^ fluxing. 

A second member of the human SOC/CRAC family of calcium charmels, SOC- 
3/CRAC-2, is represented as the nucleic acid of SEQ ID NO:29, and codes for the human 
SOC-3/CRAC-2 polypeptide having the predicted amino acid sequence represented as SEQ 
ID NO:30 (this molecule may also be referred to as CECH2). SOC-3/CRAC-2 is 
predominantly expressed in human hematopoietic cells (including peripheral blood 
lymphocytes, liver, bone marrow, spleen, thymus, lymph nodes, heart, and kidney. 
Expression can also be detected (at lesser levels) in brain, skeletal muscle colon, small 
intestine, placenta, lung, and cells (cell lines) such as HL-60, HeLa, K562, MOLT-4, SW-480, 
A459,andG361. 
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A third member of the human SOC/CRAC family of calcium chamiels, SOC- 
4/CRAC-3, is represented as the nucleic acid of SEQ ID NO:31, and codes for the human 
SOC-4/CRAC-3 polypeptide having the predicted amino acid sequence represented as SEQ 
ID NO:32 (this molecule may also be referred to as CECH6). It specifically expressed in the 
prostate gland/cells. 

As used herein, a SOC/CRAC calcium channel nucleic acid (also referred to herein as 
a "SOC/CRAC nucleic acid" refers to a nucleic acid molecule which: (1) hybridizes under 
stringent conditions to one or more of the nucleic acids having the sequences of SEQ. ID 
NOS. 7, 27, 29, and/or 31 (sequences of the mouse and human SOC-2/CRAC-1, human SOC- 
3/CRAC-2, and human SOC-4/CRAC-3 nucleic acids), and (2) codes for a SOC-2/CRAC-1, a 
SOC-3/CRAC-2 or a SOC-4/CRAC-3 calcium channel polj^eptide, respectively, or unique 
fragments of said SOC-2/CRAC-1, SOC-3/CRAC-2, or SOC-4/CRAC-3 polypeptide. 

As used herein, a SOC/CRAC calcium channel polypeptide (also referred to herein as 
a "SOC/CRAC polypeptide") refers to a polypeptide that is coded for by a SOC-2/CRAC-1 , a 
SOC-3/CRAC-2, and/or a SOC-4/CRAC-3 nucleic acid. Preferably, the above-identified 
SOC/CRAC polypeptides mediate transport of caiciun\into and out of a cell. 

SOC/CRAC polypeptides also are useful as immimogenic molecules for the 
generation of binding polypeptides (e.g., antibodies) which bind selectively to SOC/CRAC 
(e.g., SOC-2/CRAC-1, SOC-3/CRAC-2, and/or SOC-4/CRAC-3) polypeptides. Such 
antibodies can be used in diagnostic assays to identify and/or quantify the presence of a 
SOC/CRAC polypeptide in a sample, such as a biological fluid or biopsy sample. 
SOC/CRAC polypeptides further embrace functionally equivalent fragments, variants, and 
analogs of the preferred SOC/CRAC polypeptides, provided that the fi-agments, variants, and 
analogs also are useful in mediating calcium transport into and out of intracellular calcium 
stores. 

As used herein, "SOC/CRAC calcium channel activity" refers to Ca^"*" transport ("Ca^"*^ 
fluxing") across the plasma membrane that is mediated by a SOC/CRAC calcium channel 
polypeptide. The SOC/CRAC calciimi channel polypeptide typically has one or more of the 
following properties: high selectivity, a unitary conductance below the detection level of the 
patch clamp method,and are subject to inhibition by high intracellular calcium levels. Such 
activity can be easily detected using standard methodology well known in the art. See, e.g., 
the Examples and Neher, E., "Ion channels for communication between and within cells". 
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Science. 1992 : 256:498-502: and Hoth, M.. and Penner, R., ^'Depletion of intracellular 
calcium stores activates a calcium current in mast cells", Nature, 1992 : 355 (6358):353-6. 

According to one aspect of the invention, isolated nucleic acid molecules which code 
for one or more member(s) of the SOC/CRAC family of calcium channel polypeptides are 
provided. The isolated nucleic acid molecules are selected from the following groups: 

(a) nucleic acid molecules which hybridize under stringent conditions to one or more 
nucleic acid molecules selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, and SEQ ID NO:3 1 , and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that diifer from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

In certain embodiments, the isolated nucleic acid molecule comprises one or more of 
nucleotides 1-1212 of SEQ ID NO:l; nucleotides 1-739 of SEQ ID NO:3; nucleotides 1-1579 
of SEQ ID NO:5; nucleotides 1-5117 of SEQ ID NO:23; the mouse homolog for SOC- 
2/CRAC-l corresponding to SEQ ID NO:7; nucleotides 1-2180 of SEQ ID NO:25; 
nucleotides 382-5976 of SEQ ID NO:27; nucleotides 73-3714 of SEQ ID NO:29; and 
nucleotides 23-3434 of SEQ ID NO:31. In yet other embodiments, the isolated nucleic acid 
molecule comprises a molecule which encodes a polypeptide having one or more sequences 
selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32. 

According to yet another aspect of the invention, an isolated nucleic acid molecule is 
provided which is selected from the group consisting of: 

(a) a imique fragment of a nucleic acid molecule selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31, (of sufficient length to represent a 
sequence unique within the human genome); and (b) complements of (a), provided that the 
unique fragment includes a sequence of contiguous nucleotides which is not identical to a 
sequence in the prior art as represented by the sequence group consisting of: (1) sequences 
having the SEQ ID NOs or GenBank accession numbers of Table I, (2) complements of (1), 
and (3) fragments of (1) and (2). 
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In some embodiments, the sequence of contiguous nucleotides is selected from the 
group consisting of (1) at least two contiguous nucleotides nonidentical to the sequence 
group, (2) at least three contiguous nucleotides nonidentical to the sequence group, (3) at least 
four contiguous nucleotides nonidentical to the sequence group, (4) at least five contiguous 
nucleotides nonidentical to the sequence group, (5) at least six contiguous nucleotides 
nonidentical to the sequence group, (6) at least seven contiguous nucleotides nonidentical to 
the sequence group. 

In other embodiments, the unique fragment has a size selected from the group 
consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 14 nucleotides, 16 
nucleotides, 18 nucleotides, 20, nucleotides, 22 nucleotides, 24 nucleotides, 26 nucleotides, 
28 nucleotides, 30 nucleotides, 40 nucleotides, 50 nucleotides, 75 nucleotides, 100 
nucleotides, 200 nucleotides, 1000 nucleotides and every integer length therebetween. 

According to another aspect of the invention, expression vectors and host cells 
containing (e.g., transformed or transfected with) expression vectors comprising the nucleic 
acid molecules disclosed herein operably linked to a promoter are provided. In certain 
preferred embodiments, the host cells are eukaryotic cells. 

The isolated nucleic acid molecules disclosed herein have various utilities, including 
their use as probes and primers to identify additional members of the SOC/CRAC family of 
calciimi channels, as diagnostic reagents for identifying the presence of SOC/CRAC 
polypeptides in biological or other samples, and as agents for generating SOC/CRAC binding 
polypeptides (e.g., antibodies) that can be used as reagents in diagnostic and therapeutic 
assays to identify the presence, absence, and/or amounts of a SOC/CRAC nucleic acid or 
polypeptide in a biological or other sample. 

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified 
in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinantly produced by 
cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, 
chemical synthesis. An isolated nucleic acid is one which is readily manipulatable by 
recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained 
in a vector in which 5' and 3' restriction sites are known or for which polymerase chain 
reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid 
sequence existing in its native state in its natural host is not. An isolated nucleic acid may be 
substantially purified, but need not be. For example, a nucleic acid that is isolated within a 
cloning or expression vector is not pure in that it may comprise only a tiny percentage of the 
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material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is 
used herein because it is readily manipulatable by standard techniques known to those of 
ordinary skill in the art. 

As used herein with respect to polypeptides (discussed below), the term "isolated" 
means separated from its native environment in sufficiently pure form so that it can be 
manipulated or used for any one of the purposes of the invention. Thus, isolated means 
sufficiently pure to be used (i) to raise and/or isolate antibodies, (ii) as a reagent in an assay, 
or (iii) for sequencing, etc. 

Homologs and alleles of the SOC/CRAC nucleic acids of the invention can be 
identified by conventional techniques. Thus, an aspect of the invention is those nucleic acid 
sequences which code for SOC/CRAC polypeptides and which hybridize to a nucleic acid 
molecule selected from a group consisting of the nucleic acid of SEQ ID NO:l, the nucleic 
acid of SEQ ID NO:3, the nucleic acid of SEQ ID NO:5, the nucleic acid of SEQ ID NO:7, 
the nucleic acid of SEQ ID NO:23, the nucleic acid of SEQ ID NO:25, the nucleic acid of 
SEQ ID NO:27, the nucleic acid of SEQ ID NO:29, and the nucleic acid of SEQ ID NO:31, 
under stringent conditions. The term ^'stringent conditions*' as used herein refers to 
parameters with which the art is familiar. Nucleic acid hybridization parameters may be 
foimd in references which compile such methods, e.g. Molecular Cloning: A Laboratory 
Manual, J. Sambrook, et al., eds.. Second Edition, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, 
et al., eds., John Wiley & Sons, Inc., New York. More specifically, stringent conditions, as 
used herein, refers, for example, to hybridization at 65**C in hybridization buffer (3.5 x SSC, 
0.02% Ficoll, 0.02% polyvinyl pyrolidone, 0.02% Bovine Serum Albumin, 2.5mM 
NaH2P04(pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium chloride/0. 15M sodium 
citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is ethylenediaminetetracetic acid. 
After hybridization, the membrane upon which the DNA is transferred is washed at 2 x SSC 
at room temperature and then at 0.1 x SSC/0.1 x SDS at temperatures up to 68^C. 

There are other conditions, reagents, and so forth which can be used, and would result 
in a similar degree of stringency. The skilled artisan will be familiar with such conditions, 
and thus they are not given here. It will be understood, however, that the skilled artisan will 
be able to manipulate the conditions in a manner to permit the clear identification of 
homologs and alleles of the SOC/CRAC nucleic acids of the invention. The skilled artisan 
also is familiar with the methodology for screening cells and libraries for expression of such 
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molecules which then are routinely isolated, followed by isolation of the pertinent nucleic 
acid molecule and sequencing. 

In general homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and/or SEQ ID 
NO:31, and SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, respectively. In 
some instances sequences will share at least 50% nucleotide identity and/or at least 65% 
amino acid identity and in still other instances sequences will share at least 60% nucleotide 
identity and/or at least 75% amino acid identity. The homology can be calculated using 
various, publicly available software tools developed by NCBI (Bethesda, Maryland) that can 
be obtained through the internet (ftp.7ncbi.nlm.nih.gov/pub/). Exemplary tools include the 
BLAST system available at http://www.ncbi.nlm.nih.gov. Pairwise and ClustalW alignments 
(BLOSUM30 matrix setting) as well as Kyte-Doolittle hydropathic analysis can be obtained 
using the MacVetor sequence analysis software (Oxford Molecular Group). Watson-Crick 
complements of the foregoing nucleic acids also are embraced by the invention. 

In screening for SOC/CRAC related genes, such as homologs and alleles of SOC- 
2/CRAC-l and/or SOC-3/CRAC-2, a Southern blot may be performed using the foregoing 
conditions, together with a radioactive probe. After washing the membrane to which the 
DNA is finally transferred, the membrane can be placed against X-ray film or a 
phosphoimager plate to detect the radioactive signal. 

Given that the expression of the SOC/CRAC gene is prominent in certain human 
tissues (e.g., SOC-2/CRAC-1: lymphoid tissue/heart, SOC-3/CRAC-2: kidney/colon, SOC- 
4/CRAC-3: prostate), and given the teachings herein of partial human SOC/CRAC cDNA 
clones, full-length and other mammalian sequences corresponding to the human SOC/CRAC 
partial nucleic acid sequences can be isolated fi-om, for example, a cDNA library prepared 
from one or more of the tissues in which SOC-2/CRAC-1 expression is prominent, SOC- 
3/CRAC-2 is prominent, and/or SOC-4/CRAC-3 expression is prominent, using standard 
colony hybridization techniques. 

The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the 
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art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vivo, to incorporate a serine residue into an elongating 
SOC/CRAC polypeptide. Similarly, nucleotide sequence triplets which encode other amino 
acid residues include, but are not limited to: CCA, CCC, CCG and CCT (proline codons); 
CGA, CGC, COG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG and ACT 
(threonine codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT (isoleucine 
codons). Other amino acid residues may be encoded similarly by multiple nucleotide 
sequences. Thus, the invention embraces degenerate nucleic acids that differ from the 
biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic 
code. 

The invention also provides isolated unique fragments of an isolated nucleic acid 
molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID 
NO:31. A unique fragment is one that is a 'signature' for the larger nucleic acid. For 
example, the unique fragment is long enough to assure that its precise sequence is not found 
in molecules within the himian genome outside of the SOC/CRAC nucleic acids defined 
above (and human alleles). Those of ordinary skill in the art may apply no more than routine 
procedures to determine if a fiagnient is unique v^thin the human genome. 

Unique fragments, however, exclude fragments completely composed of the 
nucleotide sequences of any of GenBank accession numbers and SEQ ID NOs listed in Table 
I (SEQ ID NO:9, AB001535, AI226731, H18835, AA419592, AA261842, AA419407, 
AI098310, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, AC005538, AA654650, AA370110, AA313170, 
AA493512, AI670079, A1671853, AC005538, AA654650, AA370110, AA313170, 
AA493512, AI670079, AI671853), or other previously published sequences as of the filing 
date of this application. 

A fragment which is completely composed of the sequence described in the foregoing 
GenBank deposits and SEQ ID NO:9, is one which does not include any of the nucleotides 
vmique to the sequences of the invention. Thus, a unique fragment must contain a nucleotide 
sequence other than the exact sequence of those in GenBank or fragments thereof. The 
difference may be an addition, deletion or substitution with respect to the GenBank sequence 
or it may be a sequence wholly separate from the GenBank sequence. 
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Unique fragments can be used as probes in Southern and Northern blot assays to 
identify such nucleic acids, or can be used in amplification assays such as those employing 
PGR. As known to those skilled in the art, large probes such as 200, 250, 300 or more 
nucleotides are preferred for certain uses such as Southern and Northern blots, while smaller 
fragments will be preferred for uses such as PGR. Unique fragments also can be used to 
produce fusion proteins for generating antibodies or determining binding of the polypeptide 
fragments, as demonstrated in the Examples, or for generating immunoassay components. 
Likewise, unique fragments can be employed to produce nonflised fragments of the 
SOG/CRAC polypeptides, useful, for example, in the preparation of antibodies, 
immxmoassays or therapeutic applications. Unique fragments further can be used as antisense 
molecules to inhibit the expression of SOG/GRAG nucleic acids and polypeptides, 
respectively. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO:l, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, and SEQ ID NO:31, and complements thereof, will require longer segments 
to be unique while others will require only short segments, typically between 12 and 32 
nucleotides long (e.g. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31 and 32 bases) or more, up to the entire length of the disclosed sequence. As mentioned 
above, this disclosure intends to embrace each and every fragment of each sequence, 
beginning at the first nucleotide, the second nucleotide and so on, up to 8 nucleotides short of 
the end, and ending anywhere from nucleotide number 8, 9, 10 and so on for each sequence, 
up to the very last nucleotide, (provided the sequence is unique as described above). Virtually 
any segment of the region of SEQ ID NO:l beginning at nucleotide 1 and ending at 
nucleotide 1212, or SEQ ID NO:3 beginning at nucleotide 1 and ending at nucleotide 739, or 
SEQ ID NO:5 beginning at nucleotide 1 and ending at nucleotide 1579, or SEQ ID NO:7 
beginning at nucleotide 1 and ending at nucleotide 3532, or SEQ ID NO:23 beginning at 
nucleotide 1 and ending at nucleotide 5117, SEQ ID NO:25 beginning at nucleotide 1 and 
ending at nucleotide 2180, SEQ ID NO:27 beginning at nucleotide 1 and ending at nucleotide 
7419, or SEQ ID NO:29 beginning at nucleotide 1 and ending at nucleotide 4061, or SEQ ID 
NO:3 1 beginning at nucleotide 1 and ending at nucleotide 4646, or complements thereof, that 
is 20 or more nucleotides in length will be unique. Those skilled in the art are well versed in 
methods for selecting such sequences, typically on the basis of the ability of the unique 
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fragment to selectively distinguish the sequence of interest from other sequences in the human 
genome of the fragment to those on known databases typically is edl that is necessary, 
although in vitro confirmatory hybridization and sequencing analysis may be performed. 

As mentioned above, the invention embraces antisense oligonucleotides that 
selectively bind to a nucleic acid molecule encoding a SOC/CRAC polypeptide, to decrease 
SOC/CRAC calcium channel activity. When using antisense preparations of the invention, 
slow intravenous administration is preferred. 

As used herein, the term "antisense oligonucleotide" or "antisense'* describes an 
oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes imder 
physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript. Those 
skilled in the art will recognize that the exact length of the antisense oligonucleotide and its 
degree of complementarity with its target will depend upon the specific target selected, 
including the sequence of the target and the particular bases which comprise that sequence. It 
is preferred that the antisense oligonucleotide be constructed and arranged so as to bind 
selectively with the target under physiological conditions, i.e., to hybridize substantially more 
to the target sequence than to any other sequence in the target cell imder physiological 
conditions. Based upon SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31, or upon allelic 
or homologous genomic and/or cDNA sequences, one of skill in the art can easily choose and 
synthesize any of a number of appropriate antisense molecules for use in accordance with the 
present invention. In order to be sufficiently selective and potent for inhibition, such 
antisense oligonucleotides should comprise at least 10 and, more preferably, at least 15 
consecutive bases which are complementary to the target, although in certain cases modified 
oligonucleotides as short as 7 bases in length have been used successfully as antisense 
oligonucleotides (Wagner et al., Nat. Med 1(11):1 1 16-1 1 18, 1995). Most preferably, the 
antisense oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal 
or 5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
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addition, 3 '-untranslated regions may be targeted by antisense oligonucleotides. Targeting to 
mRNA splicing sites has also been used in the art but may be less preferred if alternative 
mRNA splicing occurs. In addition, the antisense is targeted, preferably, to sites in which 
mRNA secondary structure is not expected (see, e.g., Sednio et al.. Cell Mol NeurobioL 
14(5):439-457, 1994) and at which proteins are not expected to bind. Finally, although, SEQ 
ID No:l discloses a cDNA sequence, one of ordinary skill in the art may easily derive the 
genomic DNA corresponding to this sequence. Thus, the present invention also provides for 
antisense oligonucleotides which are complementary to the genomic DNA corresponding to 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5,SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31. Similarly, antisense to allelic or 
homologous SOC/CRAC cDNAs and genomic DNAs are enabled without undue 
experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. 
That is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out 
manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

In preferred embodiments^ however, the antisense oligonucleotides of the invention 
also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a nimiber of ways which do not prevent them from hybridizing to their target but ^^ch 
enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one 
nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group not normally 
associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred 
synthetic intemucleoside linkages are phosphorothioates, alkylphosphonates, 
phosphorodithioates, phosphate esters, alkylphosphonothioates, phosphoramidates, 
carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
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oligonucieotides having backbone sugars which are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3' position and other than a 
phosphate group at the S' position. Thus modified oligonucleotides may include a 2'-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 
arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 
preparations containing modified antisense molecules that are complementary to and 
hybridizable with, under physiological conditions, nucleic acids encoding SOC/CRAC 
polypeptides, together with pharmaceutically acceptable carriers. Antisense oligonucleotides 
may be administered as part of a pharmaceutical composition. Such a pharmaceutical 
composition may include the antisense oligonucleotides in combination with any standard 
physiologically and/or pharmaceutically acceptable carriers which are known in the art. The 
compositions should be sterile and contain a therapeutically effective amoimt of the antisense 
oligonucleotides in a unit of weight or volume suitable for administration to a patient. The 
term "pharmaceutically acceptable" means a non-toxic material that does not interfere with 
the effectiveness of the biological activity of the active ingredients. The term 
'^physiologically acceptable" refers to a non-toxic material that is compatible with a biological 
system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will 
depend on the route of administration. Physiologically and pharmaceutically acceptable 
carriers include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials 
which are well known in the art. 

The invention also involves expression vectors coding for SOC/CRAC proteins and 
fragments and variants thereof and host cells containing those expression vectors. Virtually 
any cells, prokeuyotic or eukaryotic, which can be transformed with heterologous DNA or 
RNA and which can be grown or maintained in culture, may be used in the practice of the 
invention. Examples include bacterial cells such as E.coli and eukaryotic cells such as mouse, 
hamster, pig, goat, primate, yeast, xenopous, etc. They may be of a wide variety of tissue 
types, including mast cells, fibroblasts, oocytes and lymphocytes, and they may be primary 
cells or cell lines. Specific examples include CHO cells and COS cells. Cell-free 
transcription systems also may be used in lieu of cells. 

As used herein, a "vector" may be any of a nimiber of nucleic acids into which a 
desired sequence may be inserted by restriction and ligation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of 
DNA although RNA vectors are also available. Vectors include, but are not limited to. 
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plasmids, phagemids and virus genomes. A cloning vector is one which is able to replicate in 
a host cell, and which is further characterized by one or more endonuclease restriction sites at 
which the vector may be cut in a determinable fashion and into which a desired DNA 
sequence may be ligated such that the new recombinant vector retains its ability to replicate in 
the host cell. In the case of plasmids, replication of the desired sequence may occur many 
times as the plasmid increases in copy number within the host bacterium or just a single time 
per host before the host reproduces by mitosis. In the case of phage, replication may occur 
actively during a lytic phase or passively during a lysogenic phase. An expression vector is 
one into which a desired DNA sequence may be inserted by restriction and ligation such that 
it is operably joined to regulatory sequences and may be expressed £is an RNA transcript. 
Vectors may further contain one or more marker sequences suitable for use in the 
identification of cells which have or have not been transformed or transfected vdth the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes 
whose activities are detectable by standard assays known in the art (e.g., P-galactosidase or 
alkaline phosphatase), and genes which visibly ajSect the phenotype of transformed or 
transfected cells, hosts, colonies or plaques (e.g., green fluorescent protein). Preferred vectors 
are those capable of autonomous replication and expression of the structural gene products 
present in the DNA segments to which they are operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked in such a way as to place the expression or 
transcription of the coding sequence under the influence or control of the regulatory 
sequences. If it is desired that the coding sequences be translated into a functional protein, 
two DNA sequences are said to be operably joined if induction of a promoter in the 5' 
regulatory sequences results in the transcription of the coding sequence and if the nature of 
the linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region to direct the transcription 
of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript 
to be translated into a protein. Thus, a promoter region would be operably joined to a coding 
sequence if the promoter region were capable of effecting transcription of that DNA sequence 
such that the resulting transcript might be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribed 
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and 5' non*translated sequences involved with the initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. 
Especially, such 5' non-transcribed regulatory sequences will include a promoter region 
which includes a promoter sequence for transcriptional control of the operably joined gene. 
Regulatory sequences may also include enhancer sequences or upstream activator sequences 
as desired. The vectors of the invention may optionally include 5' leader or signal sequences. 
The choice and design of an appropriate vector is within the ability and discretion of one of 
ordinary skill in the art. 

According to yet another aspect of the invention, isolated SOC/CRAC polypeptides 
are provided. Preferably, the isolated SOC/CRAC polypeptides are encoded by the isolated 
SOC/CRAC nucleic acid molecules disclosed herein. More preferably, the isolated 
SOC/CRAC polypeptides of the invention are encoded by the nucleic acid molecules having 
SEQ ID Nos. 1, 3, 5, 7, 23, 25, 27, 29, and 31. In yet other embodiments, the isolated 
SOC/CRAC polypeptides of the invention have an amino acid sequence selected from the 
group consisting of SEQ ID Nos. 2, 4, 6, 8, 24, 26, 28, 30 and 32. Preferably, the isolated 
SOC/CRAC polypeptides are of sufficient length to represent a sequence unique within the 
human genome. Thus, the preferred embodiments include a sequence of contiguous amino 
acids which is not identical to a prior art sequence as represented by the sequence group 
consisting of the contiguous amino acids identified in Table II (SEQ ID NO: 10, SEQ ID 
NO: 13, SEQ ID NO: 1 4, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19 and GenBank Acc. 
Nos. AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP002411, CAA92726, 
CAB05572). 

In certain embodiments, the isolated SOC/CRAC polypeptides are immunogenic and 
can be used to generate binding polypeptides (e.g., antibodies) for use in diagnostic and 
therapeutic applications. Such binding polypeptides also are useful for detecting the presence, 
absence, and/or amounts of a SOC/CRAC nucleic acid or polypeptide in a sample such as a 
biological fluid or biopsy sample. Preferably, the SOC/CRAC polypeptides that are useful for 
generating binding polypeptides are xmique polypeptides and, therefore, binding of the 
antibody to a SOC/CRAC polypeptide in a sample is selective for the SOC/CRAC 
polypeptide. 

Expression vectors containing all the necessary elements for expression are 
commercially available and known to those skilled in the art. See, e.g., Sambrook et al.. 
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Molecular Cloning: A Laboratory Manual^ Second Edition, Cold Spring Harbor Laboratory 
Press, 1989. Cells are genetically engineered by the introduction into the cells of 
heterologous DNA (RNA) encoding a SOC/CRAC polypeptide or fragment or variant thereof. 
The heterologous DNA (RNA) is placed under operable control of transcriptional elements to 
permit the expression of the heterologous DNA in the host cell. 

Preferred systems for mRNA expression in manunalian cells are those such as 
pRc/CMV (available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as 
a gene that confers G418 resistance (which facilitates the selection of stably transfected cell 
lines) and the human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, 
suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen, 
Carlsbad, CA), which contains an Epstein Barr virus (EBV) origin of replication, facilitating 
the maintenance of plasmid as a multicopy extrachromosomal element. Another expression 
vector is the pEF-BOS plasmid containing the promoter of polypeptide Elongation Factor la, 
which stimulates efficiently transcription in vitro. The plasmid is described by Mishizuma 
and Nagata {Nuc. Acids Res. 18:5322, 1990), and its use in transfection experiments is 
disclosed by, for example, Demoulin (Mol Cell Biol 16:4710-4716, 1996). Still another 
preferred expression vector is an adenovirus, described by Stratford-Perricaudet, which is 
defective for El and E3 proteins {J, Clin. Invest. 90:626-630, 1992). The use of the 
adenovirus as an Adeno.PlA recombinant is disclosed by Wamier et al., in intradermal 
injection in mice for immunization against PIA (Int. J. Cancer^ 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to 
prepare a desired expression vector or vectors. Such expression kits include at least separate 
portions of each of the previously discussed coding sequences. Other components may be 
added, as desired, as long as the previously mentioned sequences, which are required, are 
included. 

It will also be recognized that the invention embraces the use of the above described, 
SOC/CRAC cDNA sequence containing expression vectors, to transfect host cells and cell 
lines, by these prokaryotic (e.g., E. coli l or eukaryotic (e.g., CHO cells, COS cells, yeast 
expression systems and recombinant baculovirus expression in insect cells). Especially useful 
are mammalian cells such as mouse, hamster, pig, goat, primate, etc. They may be of a v^de 
variety of tissue types, and include primary cells and cell lines. Specific examples include 
dendritic cells, U293 cells, peripheral blood leukocytes, bone marrow stem cells and 
embryonic stem cells. The invention also permits the construction of SOC/CRAC gene 
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"knock-outs'" in cells and in animals, providing materials for studying certain aspects of 
SOC/CRAC calcium channel activity. 

The invention also provides isolated polypeptides (including whole proteins and 
partial proteins), encoded by the foregoing SOC/CRAC nucleic acids, and include the 
polypeptides of SEQ ID NO:2, 4, 6, 8, 24, 26, 28, 30, 32, and unique fragments thereof .Such 
polypeptides are useful, for example, to regulate calcium transport-mediated cell growth, 
differentiation and proliferation, to generate antibodies, as components of immunoassays, etc. 
Polypeptides can be isolated from biological samples including tissue or cell homogenates, 
and can also be expressed recombinantly in a variety of prokaryotic and eukaryotic expression 
systems by constructing an expression vector appropriate to the expression system, 
introducing the expression vector into the expression system, and isolating the recombinantly 
expressed protein. Short polypeptides, including antigenic peptides (such as are presented by 
MHC molecules on the surface of a cell for immune recognition) also can be synthesized 
chemically using well-established methods of peptide synthesis. 

A unique fragment of a SOC/CRAC polypeptide, in general, has the features and 
characteristics of unique fragments as discussed above in connection with nucleic acids. As 
will be recognized by those skilled in the art, the size of the unique fragment will depend 
upon factors such as whether the fragment constitutes a portion of a conserved protein 
domain. Thus, some regions of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 
SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, will 
require longer segments to be unique while others will require only short segments, typically 
between 5 and 12 amino acids (e.g. 5, 6, 7, 8, 9, 10, 11 and 12 amino acids long or more, 
including each integer up to the full length, > 1,000 amino acids long). Virtually any segment 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, and/or SEQ ID NO:32, excluding the ones that share 
identity with it (the polypeptides identified in Table II - SEQ ID NO: 10, SEQ ID NO: 13, SEQ 
ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, and GenBank Acc. Nos. 
AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, NP003298, CAB00861, NP002411, CAA92726, 
CAB05572) that is 9 or more amino acids in length will be unique. 

Unique fragments of a polypeptide preferably are those fragments which retain a 
distinct functional capability of the polypeptide. Functional capabilities which can be retained 
in a imique fragment of a polypeptide include Ca^"*" fluxing, high selectivity, a unitary 
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conductance below the detection level of the patch clamp method, and/or and are subject to 
inhibition by high intracellular calcium levels. 

One important aspect of a unique fragment is its ability to act as a signature for 
identifying the polypeptide. Optionally, another aspect of a unique fragment is its ability to 
provide an immune response in an animal. Those skilled in the art are well versed in methods 
for selecting unique amino acid sequences, typically on the basis of the ability of the imique 
fragment to selectively distinguish the sequence of interest from non-family members. A 
comparison of the sequence of the fragment to those on known databases typically is all that 
is necessary. 

The invention embraces variants of the SOC/CRAC polypeptides described above. As 
used herein, a "variant" of a SOC/CRAC polypeptide is a polypeptide which contains one or 
more modifications to the primziry amino acid sequence of a SOC/CRAC polypeptide. 
Modifications which create a SOC/CRAC polypeptide variant are typically made to the 
nucleic acid which encodes the SOC/CRAC polypeptide, and can include deletions, point 
mutations, truncations, amino acid substitutions and addition of amino acids or non-amino 
acid moieties to: 1) reduce or eliminate a calcium channel activity of a SOC/CRAC 
polypeptide; 2) enhance a property of a SOC/CRAC polypeptide, such as protein stability in 
an expression system or the stability of protein-protein binding; 3) provide a novel activity or 
property to a SOC/CRAC polypeptide, such as addition of an antigenic epitope or addition of 
a detectable moiety; or 4) to provide equivalent or better binding to a SOC/CRAC polypeptide 
receptor or other molecule. Alternatively, modifications can be made directly to the 
polypeptide, such as by cleavage, addition of a linker molecule, addition of a detectable 
moiety, such as biotin, addition of a fatty acid, and the like. Modifications also embrace 
fusion proteins comprising all or part of the SOC/CRAC amino acid sequence. One of skill in 
the art will be familiar with methods for predicting the effect on protein conformation of a 
change in protein sequence, and can thus "design" a variant SOC/CRAC polypeptide 
according to known methods. One example of such a method is described by Dahiyat and 
Mayo in Science 278:82-87, 1997, whereby proteins can be designed de novo. The method 
can be applied to a known protein to vary only a portion of the polypeptide sequence. By 
applying the computational methods of Dahiyat and Mayo, specific variants of a SOC/CRAC 
calcium channel polypeptide can be proposed and tested to determine whether the variant 
retains a desired conformation. 
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Variants can include SOC/CRAC polypeptides which are modified specifically to alter 
a feature of the polypeptide unrelated to its physiological activity. For example, cysteine 
residues can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, 
certain amino acids can be changed to enhance expression of a SOC/CRAC polypeptide by 
eliminating proteolysis by proteases in an expression system (e.g., dibasic amino acid residues 
in yeast expression systems in which KEX2 protease activity is present). 

Mutations of a nucleic acid which encodes a SOC/CRAC polypeptide preferably 
preserve the amino acid reading frame of the coding sequence and, preferably, do not create 
regions in the nucleic acid which are likely to hybridize to form secondary structures, such as 
hairpins or loops, which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant SOC/CRAC polypeptides) which are silent as to the amino 
acid sequence of the polypeptide, but wdiich provide preferred codons for translation in a 
particular host. The preferred codons for tremslation of a nucleic acid in, e.g., E, coli . are well 
known to those of ordinary skill in the art. Still other mutations can be made to the noncoding 
sequences of a SOC/CRAC gene or cDNA clone to enhance expression of the polypeptide. 

The skilled artisan will realize that conservative amino acid substitutions may be made 
in SOC/CRAC polypeptides to provide functionally equivalent variants of the foregoing 
polypeptides, i.e, the variants retain the functional capabilities of the SOC/CRAC 
polypeptides. As used herein, a "conservative amino acid substitution" refers to an amino 
acid substitution which does not alter the relative charge or size characteristics of the protein 
in which the amino acid substitution is made. Variants can be prepared according to methods 
for altering polypeptide sequence known to one of ordinary skill in the art such as are found in 
references which compile such methods, e.g. Molecular Cloning: A Laboratory Manual^ J. 
Sambrook, et al., eds.. Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York, 1989, or Current Protocols in Molecular Biology^ F.M. Ausubel, et al., 
eds., John Wiley & Sons, Inc., New York. Exemplary functionally equivalent variants of the 
SOC/CRAC polypeptides include conservative amino acid substitutions of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, and/or SEQ ID NO:32. Conservative substitutions of amino acids 
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include substitutions made amongst amino acids within the following groups: (a) M, I, L, V; 
(b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E, D. 

Thus fiinctionally equivalent variants of SOC/CRAC polypeptides, i.e., variants of 
SOC/CRAC polypeptides which retain the function of the natural SOC/CRAC polypeptides, 
are contemplated by the invention. Conservative amino-acid substitutions in the amino acid 
sequence of SOC/CRAC polypeptides to produce functionally equivalent variants of 
SOC/CRAC polypeptides typically are made by alteration of a nucleic acid encoding 
SOC/CRAC polypeptides (e.g., SEQ ID NOs:l, 3, 5, 7, 23, 25, 27, 29, 31). Such substitutions 
can be made by a variety of methods known to one of ordinary skill in the art. For example, 
amino acid substitutions may be made by PCR-directed mutation, site-directed mutagenesis 
according to the method of Kunkel (Kunkel, Proc. Nat Acad ScL U.S.A. 82: 488-492, 1985), 
or by chemical synthesis of a gene encoding a SOC/CRAC polypeptide. The activity of 
functionally equivalent fragments of SOC/CRAC polypeptides can be tested by cloning the 
gene encoding the altered SOC/CRAC polypeptide into a bacterial or mammalian expression 
vector, introducing the vector into an appropriate host cell, expressing the altered SOC/CRAC 
polypeptide, and testing for a functional capability of the SOC/CRAC polypeptides as 
disclosed herein (e.g., SOC/CRAC calcium channel activity). 

The invention as described herein has a number of uses, some of which are described 
elsewhere herein. First, the invention permits isolation of SOC/CRAC polypeptides, 
including the isolation of the complete SOC/CRAC polypeptide. A variety of methodologies 
well-known to the skilled practitioner can be utilized to obtain isolated SOC/CRAC 
molecules. The polypeptide may be purified from cells which naturally produce the 
polypeptide by chromatographic means or immunological recognition. Alternatively, an 
expression vector may be introduced into cells to cause production of the polypeptide. In 
another method, mRNA transcripts may be microinjected or otherwise introduced into cells to 
cause production of the encoded polypeptide. Translation of SOC/CRAC mRNA in cell-free 
extracts such as the reticulocyte lysate system also may be used to produce SOC/CRAC 
polypeptides. Those skilled in the art also can readily follow known methods for isolating 
SOC/CRAC polypeptides. These include, but are not limited to, immunochromatography, 
HPLC, size-exclusion chromatography, ion-exchange chromatography and immune-affinity 
chromatography. 

The invention also provides, in certain embodiments, "dominant negative" 
polypeptides derived from SOC/CRAC polypeptides. A dominant negative polypeptide is an 
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inactive variant of a protein, which, by interacting with the cellular machinery, displaces an 
active protein from its interaction with the cellular machinery or competes with the active 
protein, thereby reducing the effect of the active protein. For example, a dominant negative 
receptor which binds a ligand but does not transmit a signal in response to binding of the 
ligand can reduce the biological effect of expression of the ligand. Likewise, a dominant 
negative inactive SOC/CRAC calcium channel which interacts normally with the cell 
membrane but which does not mediate calcium transport can reduce calcium transport in a 
cell. Similarly, a dominant negative transcription factor which binds to a promoter site in the 
control region of a gene but does not increase gene transcription can reduce the effect of a 
normal transcription factor by occupying promoter binding sites without increasing 
transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the 
potential for a dominant negative variant of a protein, and using standard mutagenesis 
techniques to create one or more dominant negative variant polypeptides. See, e.g., U.S. 
Patent No. 5,580,723 and Sambrook et al., 1989. Molecular Cloning: A Laboratory Manual^ 
Second Edition, Cold Spring Harbor Laboratory Press. The skilled artisan then can test the 
population of mutagenized polypeptides for diminution in a selected and/or for retention of 
such an activity. Other similar methods for creating and testing dominant negative variants of 
a protein will be apparent to one of ordinary skill in the art. 

According to another aspect, the invention provides a method for isolating a 
SOC/CRAC molecule having SOC/CRAC calcium channel activity. The method involves 
contacting a binding molecule that is a SOC/CRAC nucleic acid or a SOC/CRAC binding 
polypeptide with a sample containing one or more SOC/CRAC molecules under conditions 
that allow such binding (see earlier discussion) to form a complex, detecting the presence of 
the complex, isolating the SOC/CRAC molecule from the complex, and determining whether 
the isolated SOC/CRAC molecule has SOC/CRAC calcium channel activity. Thus, the 
invention is useful for identifying and isolating full length complementary (cDNA) or 
genomic nucleic acids encoding SOC/CRAC polypeptides having SOC/CRAC calcium 
channel activity. Identification and isolation of such nucleic acids and polypeptides may be 
accomplished by hybridizing/binding, under appropriate conditions well known in the art, 
libraries and/or restriction enzyme-digested human nucleic acids, with a labeled SOC/CRAC 
molecular probe. As used herein, a "label" includes molecules that are incorporated into, for 
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example, a SOC/CRAC molecule (nucleic acid or peptide), that can be directly or indirectly 
detected. A wide variety of detectable labels are well known in the art that can be used, and 
include labels that provide direct detection (e.g., radioactivity, luminescence, optical or 
electron density, etc), or indirect detection (e.g., epitope tag such as the FLAG epitope, 
enzyme tag such as horseseradish peroxidase, etc.)- The label may be bound to a SOC/CRAC 
binding partner, or incorporated into the structure of the binding partner, 

A variety of methods may be used to detect the label, depending on the nature of the 
label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
detected through optical or electron density, radioactive emissions, nonradioactive energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. Once a library clone or 
hybridizing fragment is identified in the hybridization/binding reaction, it can be further 
isolated by employing standard isolation/cloning techniques known to those of skill in the art. 
See, generally, Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual^ 2nd 
Edition, Cold Spring Harbor Laboratory Press. In addition, nucleic acid amplification 
techniques well known in the art, may also be used to locate splice variants of calcium 
channel (or calcium channel subunits) with SOC/CRAC calcium channel activity. Size and 
sequence determinations of the amplification products can reveal splice variants. 

The foregoing isolated nucleic acids and polypeptides may then be conipared to the 
nucleic acids and polypeptides of the present invention in order to identify homogeneity or 
divergence of the sequences, and be further characterized functionally to determine whether 
they belong to a family of molecules with SOC/CRAC calcium chaimel activity (for 
methodology see under the Examples section). 

The isolation of the SOC/CRAC cDNA and/or partial sequences thereof also makes it 
possible for the artisan to diagnose a disorder characterized by an aberrant expression of 
SOC/CRAC. These methods involve determining expression of the SOC/CRAC gene, and/or 
SOC/CRAC polypeptides derived therefrom. In the former situation, such determinations can 
be carried out via any standard nucleic acid determination assay, including the polymerase 
chain reaction, or assaying with labeled hybridization probes as exemplified below. In the 
latter situation, such determination can be carried out via any standard immimological assay 
using, for example, antibodies which bind to the SOC/CRAC protein. 
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The invention also embraces isolated peptide binding agents which, for example, can 
be antibodies or fragments of antibodies ("binding polypeptides"), having the ability to 
selectively bind to SOC/CRAC polypeptides. Antibodies include polyclonal and monoclonal 
antibodies, prepared according to conventional methodology. In certain embodiments, the 
invention excludes binding agents (e.g., antibodies) that bind to the polypeptides encoded by 
the nucleic acids of SEQ ID NOs: 10, 12, 13, 14, 15, 17, and 19. 

Significantly, as is well-known in the art, only a small portion of an antibody 
molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in 
general, Clark, W.R. (1986) The Experimental Foundations of Modem Immunology Wiley & 
Sons, Inc., New York; Roitt, 1. (1991) Essential Immunology . 7th Ed., Blackwell Scientific 
Publications, Oxford). The pFc' and Fc regions, for example, are effectors of the complement 
cascade but are not involved in antigen binding. An antibody from which the pFc' region has 
been enzymatically cleaved, or which has been produced without the pFc' region, designated 
an F(ab')2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 
an antibody from which the Fc region has been enzymatically cleaved, or which has been 
produced v^dthout the Fc region, designated an Fab fi^gment, retains one of the antigen 
binding sites of an intact antibody molecule. Proceeding fiirther. Fab fragments consist of a 
covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 
The Fd fragments are the major determinant of antibody specificity (a single Fd fragment may 
be associated v^th up to ten different light chains without altering antibody specificity) and Fd 
fragments retain epitope-binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there 
are complementarity determining regions (CDRs), which directly interact with the epitope of 
the antigen, and framework regions (FRs), which maintain the tertiary structure of the 
paratope (see, in general, Clark, 1986: Roitt, 1991). In both the heavy chain Fd fragment and 
the light chain of IgG immunoglobulins, there are four framework regions (FRl through FR4) 
separated respectively by three complementarity determining regions (CDRl through CDR3). 
The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, 
are largely responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian 
antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
while retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "humanized" antibodies in which non-human CDRs 
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are covalently joined to human FR and/or Fc/pFc' regions to produce a functional antibody. 
Thus, for example, PCT International Publication Number WO 92/04381 teaches the 
production and use of humanized murine RSV antibodies in which at least a portion of the 
murine FR regions have been replaced by FR regions of human origin. Such antibodies, 
including fragments of intact antibodies with antigen-binding ability, are often referred to as 
"chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab')2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or 
FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab')2 fragment antibodies in which 
the FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 
FR and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous hiunan or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDRl and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 

Thus, the invention involves binding polypeptides of numerous size and type that bind 
selectively to SOC/CRAC polypeptides, and complexes containing SOC/CRAC polypeptides. 
These binding polypeptides also may be derived also from sources other than antibody 
technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in immobilized form, as bacterial 
flagella peptide display libraries or as phage display libraries. Combinatorial libraries also 
can be synthesized of peptides containing one or more amino acids. Libraries further can be 
synthesized of peptides and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or 
lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
One then can select phage-bearing inserts which bind to the SOC/CRAC polypeptide or a 
complex containing a SOC/CRAC polypeptide. This process can be repeated through several 
cycles of reselection of phage that bind to the SOC/CRAC polypeptide or complex. Repeated 
roimds lead to enrichment of phage bearing particular sequences. DNA sequence analysis can 
be conducted to identify the sequences of the expressed polypeptides. The minimal linear 
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poition of the sequence that binds to the SOC/CRAC polypeptide or complex can be 
detennined. One can repeat the procedure using a biased library containing inserts containing 
part or all of the minimal linear portion plus one or more additional degenerate residues 
upstream or downstream thereof. Yeast two-hybrid screening methods also may be used to 
identify polypeptides that bind to the SOC/CRAC polypeptides. Thus, the SOC/CRAC 
polypeptides of the invention, or a fragment thereof, or complexes of SOC/CRAC can be used 
to screen peptide libraries, including phage display libraries, to identify and select peptide 
binding polypeptides that selectively bind to the SOC/CRAC polypeptides of the invention. 
Such molecules can be used, as described, for screening assays, for purification protocols, for 
interfering directly with the functioning of SOC/CRAC and for other purposes that will be 
apparent to those of ordinary skill in the art. 

A SOC/CRAC polypeptide, or a fragment thereof, also can be used to isolate naturally 
occurring, polypeptide binding partners which may associate with the SOC/CRAC 
polypeptide in the membrane of a cell. Isolation of binding partners may be performed 
according to well-known methods. For example, isolated SOC/CRAC polypeptides can be 
attached to a substrate, and then a solution suspected of containing an SOC/CRAC binding 
partner may be applied to the substrate. If the binding partner for SOC/CRAC polypeptides is 
present in the solution, then it will bind to the substrate-bound SOC/CRAC polypeptide. The 
binding partner then may be isolated. Other proteins which are binding partners for 
SOC/CRAC, may be isolated by similar methods without undue experimentation. 

The invention also provides novel kits which could be used to measure the levels of 
the nucleic acids of the invention, expression products of the invention or anti-SOC/CRAC 
antibodies. In the case of nucleic acid detection, pairs of primers for amplifying SOC/CRAC 
nucleic acids can be included. The preferred kits would include controls such as known 
amounts of nucleic acid probes, SOC/CRAC epitopes (such as SOC/CRAC expression 
products) or anti-SOC/CRAC antibodies, as well as instructions or other printed material. In 
certain embodiments the printed material can characterize risk of developing a disorder that is 
characterized by aberrant SOC/CRAC polypeptide expression based upon the outcome of the 
assay. The reagents may be packaged in containers and/or coated on wells in predetermined 
amounts, and the kits may include standard materials such as labeled immunological reagents 
(such as labeled anti-IgO antibodies) and the like. One kit is a packaged polystyrene 
microliter plate coated with a SOC/CRAC polypeptide and a container containing labeled 
anti-human IgG antibodies. A well of the plate is contacted with, for example, serum, washed 
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and then contacted with the anti-IgG antibody. The label is then detected. A kit embodying 
features of the present invention is comprised of the following major elements: packaging an 
agent of the invention, a control agent, and instructions. Packaging is a box-like structure for 
holding a vial (or number of vials) containing an agent of the invention, a vial (or number of 
vials) containing a control agent, and instructions, hidividuals skilled in the art can readily 
modify packaging to suit individual needs. 

Another aspect of the invention is a method for determining the level of SOC/CRAC 
expression in a subject. As used herein, a subject is a human, non-human primate, cow, 
horse, pig, sheep, goat, dog, cat or rodent. In all embodiments, human subjects are preferred. 
Expression is defined either as SOC/CRAC mRNA expression or SOC/CRAC polypeptide 
expression. Various methods can be used to measure expression. Preferred embodiments of 
the invention include PCR and Northern blotting for measuring mRNA expression, and 
monoclonal or polyclonal SOC/CRAC antisera as reagents to measure SOC/CRAC 
polypeptide expression. In certain embodiments, test samples such as biopsy samples, and 
biological fluids such as blood, are used as test samples. SOC/CRAC expression in a test 
sample of a subject is compared to SOC/CRAC expression in control sample to, e.g., assess 
the presence or absence or stage of a proliferative disorder (e.g., a lymphocyte proliferative 
disorder) in a subject. 

SOC/CRAC polypeptides preferably are produced recombinantly, although such 
polypeptides may be isolated from biological extracts. Recombinantly produced SOC/CRAC 
polypeptides include chimeric proteins comprising a fusion of a SOC/CRAC protein vsdth 
another polypeptide, e.g., a polypeptide capable of providing or enhancing protein-protein 
binding, sequence specific nucleic acid binding (such as GAL4), enhancing stability of the 
SOC/CRAC polypeptide imder assay conditions, or providing a detectable moiety, such as 
green fluorescent protein. A polypeptide fused to a SOC/CRAC polypeptide or fragment may 
also provide means of readily detecting the fusion protein, e.g., by immunological recognition 
or by fluorescent labeling. 

The invention is also useful in the generation of transgenic non-human animals. As 
used herein, "transgenic non-human animals" includes non-human animals having one or 
more exogenous nucleic acid molecules incorporated in germ line cells and/or somatic cells. 
Thus the transgenic animal include "knockout" animals having a homozygous or 
heterozygous gene disruption by homologous recombination, animals having episomal or 
chromosomally incorporated expression vectors, etc. Knockout animals can be prepared by 
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homologous recombination using embryonic stem cells as is well known in the art. The 
recombination may be facilitated using, for example, the cre/lox system or other recombinase 
systems known to one of ordinary skill in the art. In certain embodiments, the recombinase 
system itself is expressed conditionally, for example, in certain tissues or cell types, at certain 
embryonic or post-embryonic developmental stages, inducibly by the addition of a compound 
which increases or decreases expression, and the like. In genered, the conditional expression 
vectors used in such systems use a variety of promoters which confer the desired gene 
expression pattern (e.g., temporal or spatial). Conditional promoters also can be operably 
linked to SOC/CRAC nucleic acid molecules to increase expression of SOC/CRAC in a 
regulated or conditional manner. Trans-acting negative regulators of SOC/CRAC calciimi 
channel activity or expression also can be operably linked to a conditional promoter as 
described above. Such /rflrns-acting regulators include antisense SOC/CRAC nucleic acids 
molecules, nucleic acid molecules which encode dominant negative SOC/CRAC molecules, 
ribozyme molecules specific for SOC/CRAC nucleic acids, and the like. The transgenic non- 
human animals are useful in experiments directed toward testing biochemical or physiological 
effects of diagnostics or therapeutics for conditions characterized by increased or decreased 
SOC/CRAC expression. Other uses wdll be apparent to one of ordinary skill in the art. 

The invention further provides efficient methods of identifying agents or lead 
compounds for agents active at the level of a SOC/CRAC polypeptide (e.g., a SOC/CRAC 
polypeptide) or SOC/CRAC fragment dependent cellular function. In particular, such 
functions include interaction with other polypeptides or fragments thereof, and selective 
binding to certain molecules (e.g., agonists and antagonists). Generally, the screening 
methods involve assaying for compounds which interfere with SOC/CRAC calcium channel 
activity, although compoimds which enhance SOC/CRAC calcium channel activity also can 
be assayed using the screening methods. Such methods are adaptable to automated, high 
throughput screening of compounds. The target therapeutic indications for pharmacological 
agents detected by the screening methods are limited only in that the target cellular function 
be subject to modulation by alteration of the formation of a complex comprising a 
SOC/CRAC polypeptide or fragment thereof and one or more SOC/CRAC binding targets. 
Target indications include cellular processes modulated by SOC/CRAC such as Ca^^ fluxing, 
and affected by SOC/CRAC ability to form complexes with other molecules and polypeptides 
as, for example, may be present in the cell membrane. 
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A wide variety of assays for pharmacological agents are provided^ including, 
expression assays, labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays, cell-based assays such as calcium transport assays, etc. For 
example, two-hybrid screens are used to rapidly examine the effect of transfected nucleic 
acids on the intracellular binding of SOC/CRAC or SOC/CRAC fragments to specific 
intracellular targets (e.g. a tyrosine kinase). The transfected nucleic acids can encode, for 
example, combinatorial peptide libraries or cDNA libraries. Convenient reagents for such 
assays, e.g., GAL4 fusion proteins, are known in the art. An exemplary cell-based assay 
involves transfecting a cell with a nucleic acid encoding a SOC/CRAC polypeptide fused to a 
GAL4 DNA binding domain and a nucleic acid encoding a reporter gene operably linked to a 
gene expression regulatory region, such as one or more GAL4 binding sites. Activation of 
reporter gene transcription occurs when the SOC/CRAC and reporter fusion polypeptides bind 
such as to enable transcription of the reporter gene. Agents which modulate a SOC/CRAC 
polypeptide mediated cell function are then detected through a change in the expression of 
reporter gene. Methods for determining changes in the expression of a reporter gene are 
known in the art. 

In an expression system, for example, a SOC/CRAC polypeptide is attached to a 
membrane, the membrane preferably separating two fluid environments and being otherwise 
not permeable to Ca^*. Such separation is preferred so that a change in Ca^^ concentration on 
either side of the membrane is mediated only through the attached SOC/CRAC polypeptide. 
Preferably, a SOC/CRAC polypeptide is expressed in an intact cell and is present on the cell- 
membrane (as in physiologic conditions). The cell expressing the SOC/CRAC polypeptide is 
preferably a eukaryotic cell, and the SOC/CRAC polypeptide is preferably recombinantly 
expressed, although cells naturally expressing a SOC/CRAC polypeptide may also be used. 
Synthetic membranes, however, containing SOC/CRAC polypeptides may also be used. See, 
e.g., K. Kiselyov, et al.. Functional interaction between InsP3 receptors and store-operated 
Htrp3 channels. Nature 396. 478-82 (1998). 

The cell expressing the SOC/CRAC polypeptide is incubated under conditions which, 
in the absence of the candidate agent, permit calcium flux into the cell and allow detection of 
a reference calcium concentration. For example, depletion of intracellular calcium stores with 
thapsigargin or other agents (Putney, J.W. Jr., in Capacitative Calcium Entry , R.G. Landes 
Co. and Chapman & Hall, 1997) would produce a given level of SOC/CRAC chaimel 
activation and a given reference calcium concentration. Detection of a decrease in the 
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foregoing activities (i.e., a decrease in the intracellular calcium concentration) relative to the 
reference calcium concentration indicates that the candidate agent is a lead compound for an 
agent to inhibit SOC/CRAC calcium channel activity. Preferred SOC/CRAC polypeptides 
include the polypeptides of claim 15. 

SOC/CRAC fragments used in the methods, when not produced by a transfected 
nucleic acid are added to an assay mixture as an isolated polypeptide. SOC/CRAC 
polypeptides preferably are produced recombinantly, although such polypeptides may be 
isolated from biological extracts or chemically synthesized. Recombinantly produced 
SOC/CRAC polypeptides include chimeric proteins comprising a fiision of a SOC/CRAC 
protein with another polypeptide, e.g., a polypeptide capable of providing or enhancing 
protein-protein binding, sequence specific nucleic acid binding (such as GAL4), enhancing 
stability of the SOC/CRAC polypeptide under assay conditions, or providing a detectable 
moiety, such as green fluorescent protein or Flag epitope. 

The assay mixture is comprised of a SOC/CRAC polypeptide binding target 
(candidate agent) capable of interacting with a SOC/CRAC polypeptide. While natural 
SOC/CRAC binding targets may be used, it is frequently preferred to use portions (e.g., 
peptides or nucleic acid fragments) or analogs (i.e., agents which mimic the SOC/CRAC 
binding properties of the natural binding target for purposes of the assay) of the SOC/CRAC 
binding target so long as the portion or analog provides binding afSnity and avidity to the 
SOC/CRAC polypeptide (or fragment thereof) measurable in the assay. 

The assay mixture also comprises a candidate agent (binding target, e.g., 
agonist/antagonist). Typically, a plurality of assay mixtures are run in parallel with different 
agent concentrations to obtain a different response to the various concentrations. Typically, 
one of these concentrations serves as a negative control, i.e., at zero concentration of agent or 
at a concentration of agent below the limits of assay detection. Candidate agents encompass 
numerous chemical classes, although typically they are organic compounds. Preferably, the 
candidate agents are small organic compounds, i.e., those having a molecular weight of more 
than 50 yet less than about 2500, preferably less than about 1 000 and, more preferably, less 
than about 500. Candidate agents comprise functional chemical groups necessary for 
structural interactions with polypeptides and/or nucleic acids, and typically include at least an 
amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the frmctional 
chemical groups smd more preferably at least three of the frmctional chemical groups. The 
candidate agents can comprise cyclic carbon or heterocyclic structure and/or aromatic or 
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polyaromatic structures substituted with one or more of the above-identified functional 
groups. Candidate agents also can be biomolecules such as peptides, saccharides, fatty acids, 
sterols, isoprenoids, purines, pyrimidines, derivatives or structural analogs of the above, or 
combinations thereof and the like. Where the agent is a nucleic acid, the agent typically is a 
DNA or RNA molecule, although modified nucleic acids as defined herein are also 
contemplated. 

Candidate agents are obtained from a wide variety of soiu-ces including libraries of 
synthetic or natural compounds. For example, numerous means are available for random and 
directed synthesis of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 
display libraries of random peptides, and the like. Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural and synthetically produced libraries and compounds can be 
readily modified through conventional chemical, physical, and biochemical means. Further, 
known agents may be subjected to directed or random chemical modifications such as 
acylation, alkylation, esterification, amidification, etc. to produce structural analogs of the 
agents. Non-SOC/CRAC calcium channel agonists and antagonists, for example, include 
agents such as dihydropyridines (DHPs), phenylalkylamines, omega conotoxin 
(omega.-CgTx) and pyrazonoylguanidines. 

A variety of other reagents also can be included in the mixture. These include 
reagents such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be 
used to facilitate optimal protein-protein, protein-nucleic acid, and/or protein/membrane 
component binding association. Such a reagent may also reduce non-specific or background 
interactions of the reaction components. Other reagents that improve the efficiency of the 
assay such as protease, inhibitors, nuclease inhibitors, antimicrobial agents, and the like may 
also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, 
but for the presence of the candidate agent, the SOC/CRAC polypeptide specifically binds the 
cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay 
may be readily determined. Such experimentation merely involves optimization of the assay 
parameters, not the fimdamental composition of the assay. Incubation temperatures typically 
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are between 4**C and 40*^0. Incubation times preferably are minimized to facilitate rapid, high 
throughput screening, and typically are between 0.1 and 10 hoxirs. 

After incubation, the presence or absence of specific binding between the SOC/CRAC 
polypeptide and one or more binding targets is detected by any convenient method available 
to the user. For cell free binding type assays, a separation step is often used to separate bound 
from unboimd components. The separation step may be accomplished in a variety of ways. 
Conveniently, at least one of the components is immobilized on a solid substrate, from which 
the unbound components may be easily separated. The solid substrate can be made of a wide 
variety of materials and in a wide variety of shapes, e.g., microtiter plate, microbead, dipstick, 
resin particle, etc. The substrate preferably is chosen to maximum signal to noise ratios, 
primarily to minimize background binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a 
reservoir, emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, 
particle, chromotograpic column or filter with a wash solution or solvent. The separation step 
preferably includes multiple rinses or washes. For example, when the solid substrate is a 
microtiter plate, the wells may be washed several times with a washing solution, which 
typically includes those components of the incubation mixture that do not participate in 
specific bindings such as salts, buffer, detergent, non-specific protein, etc. Where the solid 
substrate is a magnetic bead, the beads may be washed one or more times with a washing 
solution and isolated using a magnet. 

Detection may be effected in any convenient way for cell-based assays such as two- or 
three-hybrid screens. The transcript resulting from a reporter gene transcription assay of 
SOC/CRAC polypeptide interacting with a target molecule typically encodes a directly or 
indirectly detectable product, e.g., P-galactosidase activity, luciferase activity, and the like. 
For cell-free binding assays, one of the components usually comprises, or is coupled to, a 
detectable label. A wide variety of labels can be used, such as those that provide direct 
detection (e.g., radioactivity, luminescence, optical or electron density, etc.) or indirect 
detection (e.g., epitope tag such as the FLAG epitope, enzyme tag such as horseseradish 
peroxidase, etc.). The label may be bound to a SOC/CRAC binding partner, or incorporated 
into the structure of the binding partner. 

A variety of methods may be used to detect the label, depending on the nature of the 
label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
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detected through optical or electron density, radioactive emissions, nonradiative energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. 

Of particular importance in any of the foregoing assays and binding studies is the use 
of a specific sequence motif identified in the SOC-2/CRAC-1 polypeptide sequence as a 
kinase catalytic domain. According to the invention, amino acids 999-1180 of the SOC- 
2/CRAC-l polypeptide (SEQ ID NO:24) (or a fragment thereof), show a localized homology 
with the catalytic domains of eukaryotic elongation factor-2 kinase (eEF-2 kinase, GenBank 
Acc. no. U93850) and Dictyostelium myocin heavy chain kinase A (MHCK A, GenBank Acc. 
no. U16856), as disclosed in Ryazanov AG, et al., Proc Natl Acad Sci USA, 1997, 
94(10):4884-4889. Therefore, according to the invention, a method for identifying agents 
useful in the modulation of SOC/CRAC polypeptide kinase activity is provided. The method 
involves contacting a SOC/CRAC polypeptide with kinase activity, that includes, for 
example, amino acids 999- 11 80 of the SOC-2/CRAC-1 polypeptide (SEQ ID NO:24) with a 
candidate agent suspected of modulating SOC/CRAC kinase activity, under conditions 
sufBcient to allow the candidate agent to interact with the SOC/CRAC polypeptide and 
modulate its kinase activity; detecting a kinase activity associated with the SOC/CRAC 
polypeptide in the presence of the candidate agent; and comparing the kinase activity in the 
previous step with a control kinase activity of a SOC/CRAC polypeptide in the absence of the 
candidate agent to determine whether the candidate agent modulates (increases or decreases) 
SOC/CRAC kinase activity. Other controls for kinase activity can also be performed at the 
same time, for example, by utilizing eEF-2 kinase and/or Dictyostelium MHC Kinase A, in a 
similar manner to the SOC/CRAC member. Methods for performing such kinase activity 
assays are well known in the art. 

The invention thus provides SOC/CRAC-specific binding agents, methods of 
identifying and making such agents, and their use in diagnosis, therapy and pharmaceutical 
development. For example, SOC/CRAC-specific agents are usefiil in a variety of diagnostic 
and therapeutic applications, especially where disease or disease prognosis is associated with 
altered SOC/CRAC and SOC/CRAC calcium channel fluxing characteristics. Novel 
SOC/CRAC-specific binding agents include SOC/CRAC-specific antibodies and other natural 
intracellular and extracellular binding agents identified with assays such as two hybrid 
screens, and non-natural intracellular and extracellular binding agents identified in screens of 
chemical libraries and the like. 
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In general, the specificity of SOC/CRAC binding to a specific molecule is determined 
by binding equilibrium constants. Targets which are capable of selectively binding a 
SOC/CRAC polypeptide preferably have binding equilibrium constants of at least about 10^ 
M*', more preferably at least about 10* M**, and most preferably at least about 10^ M*'. The 
wide variety of cell bzised and cell fi-ee assays may be used to demonstrate SOC/CRAC- 
specific binding. Cell based assays include one, two and three hybrid screens, assays in 
which SOC/CRAC-mediated transcription is inhibited or increased, etc. Cell fi-ee assays 
include SOC/CRAC-protein binding assays, immunoassays, etc. Other assays useful for 
screening agents which bind SOC/CRAC polypeptides include fluorescence resonance energy 
transfer (FRET), and electrophoretic mobility shift analysis (EMSA). 

Various techniques may be employed for introducing nucleic acids of the invention 
into cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. 
Such techniques include transfection of nucleic acid-CaP04 precipitates, transfection of 
nucleic acids associated with DEAE, transfection with a retrovirus including the nucleic acid 
of interest, liposome mediated transfection, and the like. For certain uses, it is preferred to 
target the nucleic acid to particular cells. In such instances, a vehicle used for delivering a 
nucleic acid of the invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have 
a targeting molecule attached thereto. For example, a molecule such as an antibody specific 
for a surface membrane protein on the target cell or a ligand for a receptor on the target cell 
can be bound to or incorporated within the nucleic acid delivery vehicle. For example, where 
liposomes are employed to deliver the nucleic acids of the invention, proteins which bind to a 
surface membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 
fi-agments thereof tropic for a particular cell type, antibodies for proteins which undergo 
internalization in cycling, proteins that target intracellular localization and enhance 
intracellular half life, and the like. Polymeric delivery systems also have been used 
successfully to deliver nucleic acids into cells, as is known by those skilled in the art. Such 
systems even permit oral delivery of nucleic acids. 

Other delivery systems can include time-release, delayed release or sustained release 
delivery systems. Such systems can avoid repeated administrations of the anti-inflammatory 
agent, increasing convenience to the subject and the physician. Many types of release 
delivery systems are available and knovm to those of ordinary skill in the art. They include 
polymer base systems such as poly(lactide-glycolide), copolyoxalates, polycaprolactones. 
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polyesteramides, polyorthoesters, polyhydroxybutyric acid, and polyanhydrides. 
Microcapsules of the foregoing polymers containing drugs are described in, for example, U.S. 
Patent 5,075,109. Delivery systems also include non-polymer systems that are: lipids 
including sterols such as cholesterol, cholesterol esters and fatty acids or neutral fats such as 
mono- di- and tri-glycerides; hydrogel release systems; sylastic systems; peptide based 
systems; wax coatings; compressed tablets using conventional binders and excipients; 
partially fused implants; and the like. Specific examples include, but are not limited to: (a) 
erosional systems in which an agent of the invention is contained in a form within a matrix 
such as those described in U.S. Patent Nos, 4,452,775, 4,675,189, and 5,736,152, and (b) 
diffusional systems in which an active component permeates at a controlled rate from a 
polymer such as described in U.S. Patent Nos. 3,854,480, 5,133,974 and 5,407,686. In 
addition, pimip-based hardware delivery systems can be used, some of which are adapted for 
implantation. 

Use of a long-term sustained release implant may be particularly suitable for treatment 
of chronic conditions. Long-term release, are used herein, means that the implant is 
constructed and arranged to delivery therapeutic levels of the active ingredient for at least 30 
days, and preferably 60 days. Long-term sustained release implants are well-known to those 
of ordinary skill in the art and include some of the release systems described above. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 
gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history 
of that patent, all of which are publicly available documents. In general, it involves 
introduction in vitro of a functional copy of a gene into a cell(s) of a subject which contains a 
defective copy of the gene, and returning the genetically engineered cell(s) to the subject. The 
functional copy of the gene is under operable control of regulatory elements which permit 
expression of the gene in the genetically engineered cell(s). Numerous transfection and 
transduction techniques as well as appropriate expression vectors are well known to those of 
ordinary skill in the art, some of which are described in PCT application WO95/00654. Jn 
vivo gene therapy using vectors such as adenovirus, retroviruses, herpes virus, and targeted 
liposomes also is contemplated according to the invention. See, e.g., U.S. Patent Nos. 
5,670,488, entitled "Adenovirus Vector for Gene Therapy", issued to Gregory et al., and 
5,672,344, entitled "Viral-Mediated Gene Transfer System", issued to Kelley et al. 
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The invention will be more fully understood by reference to the following examples. 
These examples, however, are merely intended to illustrate the embodiments of the invention 
and are not to be construed to limit the scope of the invention. 

Examples 

As an initial approach to identifying SOC/CRAC channels, we considered publicly 
available data and hypothesized that the following characteristics are likely to be exhibited by 
SOC/CRAC calcium channels: i) SOC/CRAC calcium chaimels would be integral membrane 
proteins related (probably distantly) to one of the known calcium channel families (e.g. 
voltage gated, ligand gated, Trp), and therefore should have a pore region formed by a 
tetramer of 6-7 transmembrane (TM) regions; ii) high calcium selectivity was likely to come 
at the price of complexity, and therefore these were likely to be large proteins; iii) the high 
calcium selectivity of this type of channel was likely to be usefid and, therefore, highly 
conserved; and iv) these channels should be expressed in one or more types of lymphocytes, 
smce ICRAC is best defined in those cell types. Since the full genome of the nematode C. 
elegans is nearing completion, and IP3-dependent calcium signals have recently been shown 
to be required for one or more aspects of C elegans development, we took the set of proteins 
encoded by this genome (at the time this search was initiated WORMPEP14 was the available 
predicted protein set) and began searching for proteins which fit the criteria above. This 
search began by proceeding in alphabetical order through WORMPEP14 and arbitrarily 
excluding all proteins below approximately 1000 amino acids in size, followed by focusing on 
remaining proteins with clear TM spanning regions similar to those of other calcium channels. 
We stopped this screen on encountering a protein designated C05C12.3, a predicted protein 
of 1816 amino acids (SEQ ID NO:13). C05C12.3 was noteable because its central pore 
region had some sequence similarity to but was clearly distinct from members of the Trp 
family of calcium channels, and the hydrophobicity plot of this region showed a 
characteristically wide spacing between the fifth and sixth TM regions for the amino acid 
residues which are thought to line the chaimel pore region and mediate the calcium selectivity 
of the channels. In addition, it lacked any ankyrin repeats in the region amino-tenninal to its 
pore region, further distinguishing it from other Trp family proteins. 

We then used C05C12.3 for BLAST alignment screening of the rest of the C elegans 
genome and also mammalian databases for homologous proteins, revealing two other C 
elegans homologues (SEQ ID NO: 14 and SEQ ID NO: 15), and also a recently cloned 
mammalian protein named melastatin-1 (MLSN-l/SOC-1, SEQ ID NOs:9 and 10, and 
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GenBank Acc, No. AF071787). Using these sequences, we subsequently performed an 

exhaustive screening of publicly accessible EST databases in search of lymphocjrte 

homologues, but were unsuccessful in detecting any homologous transcripts in any 

lymphocyte lines. Since MLSN-1 (SEQ ID NOs:9 and 10) was expressed exclusively in 

melanocytes and retina by Northern blot hybridization and by EST database searching, there 

was no evidence that this type of channel was expressed in the type of cell in which ICRAC- 

like currents were best defined. Subsequent BLAST searches picked up mouse EST 

sequence AI098310 (SEQ ID NO:22) from a monocyte cell line. The I.M.A.G.E. consortium 

clone containing the above-identified EST was then purchased from ATCC (clone ID. 

1312756, Manassas, VA) and was fiirther characterized. Using other portions of this 

sequence in EST searches, we subsequently picked up similar sequences in human B-cells 

(SEQ ID NOs:20 and 21), and other cell types as well (SEQ ID NOs: 11, 12, 16, 17, 18, and 

19). Most of these sequences were subsequently identified to be part of the 3'-UTR or of the 

carboxy terminal region of the proteins, which are not readily identifiable as Trp channels, 

providing an explanation for the art's inability to detect any type of Trp related transcripts in 

lymphocytes. Partial sequences fi-om the 5' and/or 3' ends of the above identified clones were 

then used to screen leukocyte and kidney cDNA libraries to extend the original sequences 

more toward the 3' and/or 3' ends. 

In view of the foregoing, it was concluded that channels of this type were expressed in 

many types of lymphocytes, and therefore were members of a new family of SOC/CRAC 

calcium channels. 

Experimental Procedures 
Screening of the cDNA libraries 

Leukocyte and kidney cDNA libraries firom Life Technologies (Gaithersburg, MD) 

were screened using the Gene Trapper II methodology (Life Technologies) according to 

manufacturer's reconunendation, using the inserts of LM.A.G.E. clone ID nos. 1312756 and 

1076485 fi-om ATCC (Manassas, VA), imder stringent hybridization conditions. Using 

standard methodology ( Molecular Cloning: A Laboratory Manzdal, J. Sambrook, et al., eds.. 

Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, 

or Current Protocols in Molecular Biology, P.M. Ausubel, et al., eds., John Wiley & Sons, 

Inc., New York), individual cDNA clones were subjected to 3-4 rounds of amplification and 

purification under the same hybridization conditions. 

After excision fi-om the vector and subcloning of inserts into the plasmid forms, 

several clones were sequenced by the Beth Israel Deaconess Medical Center's Automated 
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Sequencing Facility. Molecular biological techniques such as restriction enzyme treatment, 
subcloning, DNA extraction, bacterial culture and purification of DNA fragments were 
performed according to methods well known in the art. Computer analyses of protein and 
DNA sequences was done using "Assemblylign" (Oxford Molecular, Cambell, CA). Multiple 
alignments of the SOC/CRAC family members were produced using the CLUSTAL facility 
of the MacVector program. Restriction endonucleases, expression vectors, and modifying 
enzymes were purchased from commercial sources (Gibco-BRL). Sequencing vectors for 
DNA were purchased from Stratagene (La Jolla, CA). 

Once the first members of what appeared to be a novel family of calcium channel 
receptors were identified and characterized, additional BLAST alignments were performed 
with the newly characterized nucleic acid sequences. An initial match was with genomic 
DNA fragment NH0332L1 1 (Genbank Acc, No. AC005538). Using this genomic sequence, 
promers were designed and a number of cDNA libraries was surveyed by PGR. A prostate 
specific message was identified and characterized, leading to the isolation and 
characterization of SOC-4/CRAC-3 (SEQ ID NOs: 31 and 32). 
Functional Assays 

Transient Expression of SOC/CRAC 

In our initial transient expression experiments, we expressed or expect to express a 
SOC/CRAC molecule transiently in RBL-2H3 mast cells, Jurkat T cells, and A20 
B-lymphocytes using both electroporation and vaccinia virus-driven expression, and 
measured the calcium influx produced by depletion of intracellular calcium stores with 
thapsigargin. Each of the foregoing techniques is well known to those of ordinary skill in the 
art and can be performed using various methods (see, e.g.. Current Methods in Molecular 
Biology, eds. Ausubal, F.M., et al. 1987, Green Publishers and Wiley Interscience, N.Y., 
N.Y.). Exemplary methods are described herein. 

Depletion of intracellular calcium stores is accomplished by treating the cells with 1 
micromolar thapsigargin; alternative agents which function to deplete intracellular stores are 
described in by Putney, J.W. Jr., in Caoacitative Calciimi Entry, R.G. Landes Co. and 
Chapman & Hall, 1997 and include, for example, ionomycin, cyclopiazonic acid, and DBHQ. 

Calcium influx is determined by measuring cytoplasmic calcium as indicated using the 
fura-2 fluorescent calcium indicator (see, e.g., G. Grynkiewicz, M. Poenie, R. Y. Tsien, A 
new generation of Ca2+ indicators with greatly improved fluorescence properties, J. Biol 



wo 00/4061 4 PCT/US99y29996 

-45- 

Chem 260, 3440-50 (1985), and M. Poenie, R. Tsien, Fura-2: a powerful new tool for 
measuring and imaging [Ca2+]i in single cells. Prog Clin Biol Res 210 , 53-6 (1986)). 

Patch Clamp Analysis and Determining Selectivity of SOC/CRAC 

Patch clamp analysis of cells injected with SOC/CRAC cRNA is performed by using 
the general patch technique as described in Neher, E., "Ion channels for communication 
between and within cells", Science, 1992 : 256:498-502. Specific techniques for applying the 
patch clamp analysis to RBL cells are described in Hoth, M., and Penner, R., "Depletion of 
intracellular calcium stores activates a calcium current in mast cells". Nature, 1992 : 
355:3535-355. Additional protocols for applying the patch clamp technique to other cell 
types are described in Putney, J.W. Jr., in Capacitative Calcium Entry. R.G. Landes Co. and 
Chapman & Hall, 1997 

An exemplary protocol for patch clamp analysis of SOC/CRAC molecule expressed in 
RBL-2H3 mast cells using a recombinant vaccinia virus is as follows. The currents elicited 
by store depletion are determined using the whole cell configuration (Neher, £., Science. 
1992 : 256:498-502). Currents in SOC/CRAC expressing cells are compared to currents in 
control cells expressing an irrelevant protein or a classic Trp family cedciimi channel known 
as VRl (M. J. Caterina, et al.. The capsaicin receptor: a heat-activated ion channel in the pain 
pathway [see comments]. Nature 389, 816-24 (1997)) in order to assess the contribution of 
SOC/CRAC expression. In addition, the magnitude of whole cell currents in the presence of 
extracellular calciimi (10 mM), barium (10 mM), or magnesium (10 mM) are compared to 
determine the relative permeability of the channels to each of these ions (Hoth, M., and 
Penner, R., Nature, 1992 : 355:3535-355) and, thereby, determine the ionic selectivity. 

Pharmacologic Behavior of SOC/CRAC 

For analysis of the pharmacologic behavior of a SOC/CRAC molecule, a SOC/CRAC 
molecule is expressed in RBL-2H3 mast cells using a recombinant vaccinia virus, and the 
degree of calcium influx elicited by store depletion is monitored using a bulk 
spectrofluorimeter or a fluorescence microscope and the calcium sensitive dye fura-2 (G. 
Grynkiewicz, M. Poenie, R. Y. Tsien, A new generation of Ca2^ indicators with greatly 
improved fluorescence properties, J Biol Chem 260 , 3440-50 (1985) and M. Poenie, R. 
Tsien, Fura-2: a powerful new tool for measuring and imaging [Ca2+]i in single cells, Prog 
Clin Biol Res 210, 53-6 (1986)). The level of cytoplasmic calcium in SOC/CRAC 
expressing cells is compared to the level achieved in control cells expressing an irrelevant 
protein or a classic Trp. family calcium channels known as VRl (M. J. Caterina, et al.. The 



wo 00/40614 PCTAJS99/29996 

-46- 

capsaicin receptor: a heat-activated ion channel in the pain pathway [see comments]. Nature 
389 . 816-24 (1997)). These cells then are pre-incubated with the desired pharmacologic 
reagent, and again the response to store depletion is monitored. Comparison of the effect of 
depleting stores in SOC/CRAC expressing cells relative to controls in the presence or absence 
of the pharmacologic reagent is used to assess the ability of that reagent to modulate 
SOC/CRAC activity. Sphingosine is an exemplary molecule that can be used as 
pharmacologic reagents for pharmacologic characterization of SOC/CRAC calcium channels. 
See, e.g., Mathes, C, et al.. Calcium release activated calcium current as a direct target for 
sphingosine, J Biol Chem 273(39) :25020-25030 (1998). Other non-specific calcium chaimel 
inhibitors that can be used for this purpose include SKR96365 (Calbiochem) and Lanthanum. 
Bulk Calcium Assays 

Bulk calcium assays can be performed in a PTl Deltascan bulk spectrofluorometer using fiira- 
2 as described in Scharenberg AM, et al., EMBO J, 1 995, 14(14):3385-94. 
Gene Tarstetim 

The method (and reagents) described by Buerstedde JM et al, (Cell, 1991, Oct 4;67(1):179- 
88), was used to generate "knockouts" in cells. Briefly, part of the chicken SOC-2/CRAC-1 
genomic sequence coding for the transmembrane region was cloned utilizing the hiunan 
sequence as the probe in a chicken library screen. Chicken SOC-2/CRAC-1 clones were 
isolated and characterized using standard methodology. The putative exon and domain 
arrangement of the chicken SOC-2/CRAC-1, is depicted in Figure 1. The exons coding for 
TM5 (pore region) and TM6, were replaced with promoter/antibiotic cassettes (see Figurel). 
These targeting vectors were then used to target (and replace) the endogenous gene in DT-40 
cells (chicken B lymphocyte cells). 

Results 

Example 1 : Transient Expression of SOC/CRAC 

In the above-identified cell lines and using both of the foregoing expression 
techniques, SOC/CRAC expression enhances thapsigargin-dependent influx. In addition, 
SOC/CRAC expression also enhances the amount of intracelluleir calcium stores. That this 
effect is likely due to SOC/CRAC acting as a plasma membrane calcium chaimel can be 
confirmed by producing an in-frame carboxy-terminal translational fusion with green 
fluorescent protein followed by confocal microscopy, revealing that SOC/CRAC is expressed 
predominantly as a plasma membrsme calcium channel. 
Example 2 : Patch Clamp Analysis 
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The biophysical characteristics of SOC/CRAC enhanced currents when expressed in 
Xenopiis oocytes are determined. SOC/CRAC cRNA injection is able to enhance 
thapsigargin-dependent whole cell currents. In addition, SOC/CRAC does not alter the 
reversal potential of these currents and the determination of the Pca/Pwa ratio shows that 
SOC/CRAC channels are highly calcium selective. 
Example 3 : Pharmacologic Behavior of SOC/CRAC 

The pharmacologic behavior of SOC/CRAC is evaluated as described above. 
SOC/CRAC-enhanced influx is inhibited by sphingosine in a manner that is substantially the 
same as that of endogenous thapsigargin-dependent calcium influx. 
Example 4 : Gene targeting 

Transfection of DT-40 cells with the foregoing targeting vectors, selection for 
antibiotic resistance, and screening, is coUectivelly refered to, herein, as a round of targeting. 
For the first round of targeting SOC-2/CRAC-1, 18/24 clones with homologous 
recombination of the targeting construct into one of the endogenous SOC-2/CRAC-1 alleles 
were obtained. On the second round of targeting (in order to target the second allele and 
therefore generate a homozygous SOC-2/CRAC-1 mutant cell), 0/48 clones were obtained. 
These results indicate that a "null" SOC-2/CRAC-1 mutation is detrimental to DT-40 cells, 
and that SOC-2/CRAC-1 is required for cell viability. 

Table I. Nucleotide Sequences with homologies to SOC/CRAC nucleic acids 
Sequences with SEQ ID NOs and GenBank accession numbers: 
SEQ ID NO:9, AB001535, AI226731, H18835, AA419592, AA261842, AA419407, 
AA592910, D86107, AI098310, AF071787, Z77132, Z831 17, Z68333, AA708532, 
AA551759, AA932133, R47363, N31660, AC005538, AA654650, AA3701 10, AA313170, 
AA493512, AI670079, AI671853. 

Table 11. Amino Acid Sequences with homologies to SOC/CRAC polypeptides 
Sequences with SEQ ID NOs and GenBank accession numbers: 

SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID 
NO:19, AB001535, AA592910, D86107, AF071787, Z77132, Z83117, Z68333, 
AA708532, AA551759, AA932133, R47363, N31660, NP003298, CAB00861,NP002411, 
CAA92726, CAB05572. 

All references, patents, and patent docxmients disclosed herein are incorporated by 
reference herein in their entirety. 

What is claimed is presented below and is followed by a Sequence Listing. We claim: 
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Claims 

1 . An isolated nucleic acid molecule, comprising: 

(a) nucleic acid molecules which hybridize under stringent conditions to a nucleic 
acid molecule selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and 
SEQ ID NO:3 1 , and which code for a SOC/CRAC polypeptide; 

(b) deletions, additions and substitutions of (a) which code for a respective 
SOC/CRAC polypeptide; 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or (b) in 
codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:l . 

3. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:27. 

4. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:29. 

5. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid 
molecule comprises SEQ ID NO:3 1 . 

6. An isolated nucleic acid molecule selected from the group consisting of 

(a) a unique fragment of a nucleic acid molecule selected from the group 
consisung of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:29, and SEQ ID NO:3 1 , 

(b) complements of (a), 

provided that the unique fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from a sequence group consisting of 

(1) sequences having the SEQ. ID NOS. or GenBank accession numbers of Table I, 

(2) complements of (1), and 

(3) fragments of (1 ) and (2). 
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7. The isolated nucleic acid molecule of claim 6, wherein the sequence of contiguous 
nucleotides is selected from the group consisting of: 

(1) at least two contiguous nucleotides nonidentical to the sequence group, 

(2) at least three contiguous nucleotides nonidentical to the sequence group, 

(3) at least four contiguous nucleotides nonidentical to the sequence group, 

(4) at least five contiguous nucleotides nonidentical to the sequence group, 

(5) at least six contiguous nucleotides nonidentical to the sequence group, 

(6) at least seven contiguous nucleotides nonidentical to the sequence group. 

8. The isolated nucleic acid molecule of claim 6, wherein the unique fragment has a size 
selected from the group consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 
14 nucleotides, 16 nucleotides, 18 nucleotides, 20, nucleotides, 22 nucleotides, 24 
nucleotides, 26 nucleotides, 28 nucleotides, 30 nucleotides, 50 nucleotides, 75 nucleotides, 
100 nucleotides, and 200 nucleotides. 

9. The isolated nucleic acid molecule of claim 6, wherein the molecule encodes a 
polypeptide which is immimogenic. 

10. An expression vector comprising the isolated nucleic acid molecule of claims 1, 2, 3, 
4, 5, 6, 7, 8, or 9 operably linked to a promoter. 

11. A host cell transformed or transfected with the expression vector of claim 1 0. 

12. An isolated polypeptide encoded by the isolated nucleic acid molecule according to 
anyone of claims 1 or 6, wherein the polypeptide comprises a SOC/CRAC polypeptide or a 
unique fragment thereof. 

13. The isolated polypeptide of claim 12, wherein the isolated polypeptide is encoded by 
the isolated nucleic acid molecule of claim 2, 3, 4, or 5. 

14. The isolated polypeptide of claim 13, wherein the isolated polypeptide comprises a 
polypeptide having the sequence of amino acids selected from the group consisting of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, and SEQ ID NO:32. 
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1 5. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 1, 2, 3, 
4, or 5, vs4ierein the polypeptide, or unique fragment thereof is immunogenic. 

16. An isolated binding polypeptide which binds selectively to a polypeptide encoded by 
the isolated nucleic acid molecule of claim 1, 2, 3, 4, or 5. 

17. The isolated binding polypeptide of claim 16, wherein the isolated binding 
polypeptide binds to a polypeptide having the sequence of amino acids selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32. 

18. The isolated binding polypeptide of claim 17, wherein the isolated binding 
polypeptide is an antibody or an antibody fragment selected from the group consisting of a 
Fab fragment, a F(ab)2 fragment or a fragment including a CDR3 region selective for the 
polypeptide. 

19. An isolated polypeptide, comprising a unique fragment of the polypeptide of claim 12 
of sufficient length to represent a sequence unique within the human genome, provided that 
the fragment excludes a sequence of contiguous amino acids identified in Table II. 

20. A method for isolatmg a SOC/CRAC molecule having SOC/CRAC calcium channel 
activity, comprising: 

a) contacting a binding molecule that is a SOC/CRAC nucleic acid or a SOC/CRAC 
binding polypeptide with a sample containing one or more SOC/CRAC molecules, under 
conditions sufficient to form a complex of the SOC/CRAC nucleic acid or the SOC/CRAC 
binding polypeptide and the SOC/CRAC molecule; 

b) detecting the presence of the complex; 

c) isolating the SOC/CRAC molecule from the complex; and 

d) determining whether the isolated SOC/CRAC molecule has SOC/CRAC calcium 
channel activity. 

21 . The method of claim 20, wherein the binding molecule is a SOC/CRAC nucleic acid. 

22. The method of claim 20, wherein the binding molecule is a SOC/CRAC binding 
polypeptide. 
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23. The method of claim 21, wherein the SOC/CRAC nucleic acid comprises at least 14 
nucleotides from any contiguous portion of a sequence of nucleotides selected from the group 
consisting of SEQ ID NO:!, SEQ ID NO:3. SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:3 1 . 

5 24. A method for identifying agents useful in the modulation of SOC/CRAC calcium 
chaimel activity, comprising: 

a) contacting a SOC/CRAC polypeptide with a candidate agent suspected of 
modulating SOC/CRAC calcium channel activity, tmder conditions sufficient to allow the 
SOC/CRAC polypeptide to interact selectively with the candidate agent; 
w b) detecting a Ca^"*" concentration associated with SOC/CRAC calcium channel 

activity of the SOC/CRAC polypeptide in the presence of the candidate agent; and 

c) comparing the Ca^"^ concentration of step (b) with a control Ca^"*" concentration of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates SOC/CRAC calcium channel activity. 

15 

25. A method for determining the level of SOC/CRAC expression in a subject, 
comprising: 

a) measuring the expression of SOC/CRAC in a test sample obtained from the 
subject, and 

20 b) comparing the measured expression of SOC/CRAC in the test sample to the 

expression of the SOC/CRAC polypeptide in a control to determine the level of SOC/CRAC 
expression in the subject. 

26. The method of claim 25, wherein the expression of SOC/CRAC in (b) is SOC/CRAC 
25 mRNA expression. 

27. The method of claim 25, wherein the expression of SOC/CRAC in (b) is SOC/CRAC 
polypeptide expression. 

28. The method of claim 25, wherein the test sample is tissue. 



29. 



The method of claim 25, wherein the test sample is a biological fluid. 
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30. The method of claim 26, wherein SOC/CRAC mRNA expression is measured using 
the Polymerase Chain Reaction (PGR). 

31. The method of claim 26, wherein SOC/CRAC mRNA expression is measured using a 
method selected from the group consisting of northern blotting, monoclonal antisera to 
SOC/CRAC and polyclonal antisera to SOC/CRAC. 

32. A kit, comprising a package containing: 

an agent that selectively binds to the isolated nucleic acid of clmm 1 or an 
expression product thereof, and 

a control for comparing to a measured value of binding of said agent to said 
isolated nucleic acid of claim 1 or expression product thereof. 

33. The kit of claim 32, wherein the control comprises an epitope of the expression 
product of the nucleic acid of claim 1. 

34. A pharmaceutical composition comprising; 

a pharmaceutically effective amoimt of an agent comprising of an isolated 
nucleic acid molecule of claim 1 or an expression product thereof, and 
a pharmaceutically acceptable carrier. 

35. The pharmaceutical composition of claim 34, wherein the agent is an expression 
product of the isolated nucleic acid molecule of claim 1. 

36. A method for identifying agents useful in the modulation of a SOC/CRAC polypeptide 
kinase activity, comprising: 

a) contacting a SOC/CRAC polypeptide wdth kinase activity with a candidate agent 
suspected of modulating SOC/CRAC kinase activity, imder conditions sufficient to allow the 
candidate agent to interact with the SOC/CRAC polypeptide and modulate its kinase activity; 

b) detecting a kinase activity associated with the SOC/CRAC polypeptide in the 
presence of the candidate agent; and 

c) comparing the kinase activity of step (b) with a control kinase activity of a 
SOC/CRAC polypeptide in the absence of the candidate agent to determine whether the 
candidate agent modulates SOC/CRAC kinase activity. 

37. The method of claim 36, wherein the SOC/CRAC polypeptide comprises 
amino acids 999-1 180 of the sequence represented as SEQ ID NO:24, or a fragment thereof 
that retains the kinase activity. 
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SEQUENCE LISTING 



<110> Beth Israel Deaconess Medical Center, Inc. 
Scharenberg, Andrew 

<120> CHARACTERIZATION OF A CALCIUM CHANNEL FAMILY 



<130> B0662/7026WO/ERP/KA 

<150> U.S. 60/114,220 
<151> 1998-12-30 

<150> U.S. 60/120,018 
<151> 1999-01-29 



<150> U.S. 60/140,415 
<151> 1999-06-22 



<160> 32 

<170> FastSEQ for Windows Version 3.0 



<210> 1 

<211> 1212 

<212> DNA 

<213> Homo Sapiens 



<400> 1 

gcacgaggca aattttttgt tagtacacca tctcagccaa gttgcaaaag ccacttggaa 60 

actggaacca aagatcaaga aactgtttgc tctaaagcta cagaaggaga taatacagaa 120 

tttggagcat ttgtaggaca cagagatagc atggatttac agaggtttaa agaaacatca 180 

aacaagataa aaatactatc caataacaat acttctgaaa acactttgaa acgagtgagt 24 0 

tctcttgctg gatttactga ctgtcacaga acttccattc ctgttcattc aaaacaagaa 300 

aaaatcagta gaaggccatc taccgaagac actcatgaag tagattccaa agcagcttta 360 

ataccggttt gtagatttca actaaacaga tatatattat taaatacatt aaactttttt 420 

agataagatc tacaaagtgg tgatatttgg gactatatca aaaattcaaa aaaatttttc 4 80 

ttaagaaaac tgactttagc atagtagcag ttacagaaaa gtttcttaca gtgaatagtc 540 

aggaatttta aagaaaaatt tatgcagaat aaaggcagga atctcttttt gtttgaattg 600 

aagctaatta tatgaactca tttccagcta actgcgataa tgattgattt tgcaaattcc 660 

ctttaaaagc acacactgac aagacaaaaa gctcaggaaa aggcagaaaa attactcctt 720 

tataatcaag tattatatat aagtcagtgc tcataatttt gctcaagaaa atattgactt 780 

acattcatat atatctgttc tggcatagag agattatgtt gttaaaatca tgttattgaa 840 

aaaagttatt tcagtgggga aagaggttag ttaacaaaga gattcacagt aacaaatcct 900 

cctttctgga gggactcttc ctgaccctga gctgcacaac tttgcaacaa attaaagcct 960 

aaccgaagat gacctcacaa tggcaattta gaactcatgg gagtcaactt acataaacgg 1020 

tatttgattt ctgataagat agtggaatta ttggttatag atgacaaaat aagtatgttt 1080 

aaagtgatga tggacataaa aaagttttaa atataaaaca tgagaaaaga aggagatact 114 0 

attcaaaaag actggcaaat ttgaaaaact agaaataaaa aaaaaaaaaa aaaatgagcg 1200 

gccgcaagct tt 1212 



<210> 2 

<211> 141 

<212> PRT 

<213> Homo Sapiens 



<400> 2 

Ala Arg Gly Lys Phe Phe Val Ser 

1 5 
Ser His Leu Glu Thr Gly Thr Lys 
20 



Thr Pro Ser Gin Pro Ser Cys Lys 

10 15 
Asp Gin Glu Thr Val Cys Ser Lys 
25 30 
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Ala 


Thr 


Glu 


Gly 


Asp 


Asn 


Thr 


Glu 


Phe 


Gly Ala 


Phe 


Val 


G±y 


His 


Arg 
















4 0 










4 5 








Asp 


Ser 
50 


Met 


Asp 


Leu 


Gin 


Arg 
55 


Phe 


Lys 


Glu 


Thr 


Ser 
60 


Asn 


Lys 


He 


Lys 


He 


Leu 


Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


Glu 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ser 


65 










70 










75 










80 


Ser 


Leu 


Ala 


Gly 


Phe 
85 


Thr 


Asp 


Cys 


His 


Arg 
90 


Thr 


Ser 


He 


Pro 


Val 
95 


His 


Ser 


Lys 


Gin 


Glu 
100 


Lys 


He 


Ser 


Arg 


Arg 
105 


Pro 


Ser 


Thr 


Glu 


Asp 
110 


Thr 


His 


Glu 


Val 


Asp 
115 


Ser 


Lys 


Ala 


Ala 


Leu 
120 


He 


Pro 


Val 


Cys 


Arg 
125 


Phe 


Gin 


Leu 


Asn 


Arg 


Tyr 


He 


Leu 


Leu 


Asn 


Thr 


Leu 


Asn 


Phe 


Phe 


Arg 









130 135 140 



<210> 3 

<211> 739 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 
<222> (5) . . . (5) 
<223> UNKNOWN 

<221> unsure 
<222> (21) . . . (22) 
<223> UNKNOWN 

<221> unsure 
<222> (29) . . . (29) 
<223> UNBCNOWN 



<400> 3 

tcgantaggg gtcttccacc nncatactng gatgatggtt ggtgaagtct atgcatacga 60 

aattgatgtg tgtgcaaacg attctgttat ccctcaaatc tgtggtcctg ggacgtggtt 120 

gactccattt cttcaagcag tctacctctt tgwacagtat atcattatgg ttaatcttct 180 

tattgcattt ytcaacaatg tgtatttaca agtgaaggca atttccaata ttgyatggaa 240 

gtaccagcgt tatcatttta ttatggctta tcatgagaaa ccagttctgc ctcctccact 300 

tatcattctt agccatatag tttctctgtt ttgctgcata tgtaagagaa gaaagaaaga 360 

taagacttcG gatggaccaa aacttttctt aacagaagaa gatcaaaaga aacttcatga 420 

ttttgaagag cagtgtgttg aaatgtattt caatgaaaaa gatgacaaat ttcattctgg 480 

gagtgaagag agaattcgtg tcacttttga aagagtggaa cagatgtgca ttcagattaa 540 

agaagttgga gatccgtgtc aactacataa aaagatcatt acaatcatta gattctcaaa 600 

ttggccattt gcaagatctt tcagccctga cggtagatac attaaaaaca ctcactggcc 660 

aaaagcgtcg gaagctagca aagttcataa tgaaatcaca cgagaactga gcatttccaa 720 

acacttggct caaaacctt 739 



<210> 4 
<2H> 235 
<212> PRT 

<213> Homo Sapiens 
<220> 

<221> UNSURE 
<222> (41) . . . (41) 
<223> UNKNOWN 



<221> UNSURE 
<222> (54)... (54) 
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<221> UNSURE 
<222> (68) . . . (68) 
<223> UNKNOWN 



<400> 4 



Met 


Met 


Val 


Gly 


Glu 


vax 


Tyr 


H.X a 


Tyr 


nit'. 


He 


Asp 


Val 


Cys 


Ala 


Asn 


1 






5 










1 u 














Asp 


Ser 


Val 


lie 


Pro 


Gin 


He 


Cys 




Pro 


<D± y 


1 riir 


Trp 


Leu 




Pro 






20 










25 
















Phe 


Leu 


Gin 
35 


Ala 


Val 


Tyr 


Leu 


Phe 
4 0 


Xaa 


taXn 


Tyr 


J. xe 


X J. e 




V cl X 


Asn 


Leu 


Leu 
50 


lie 


Ala 


Phe 


Xaa 


Asn 
55 


Asn 


Val 


Tyr 


Leu 


oxn 

ct\ 
bO 


vax 


Lys 


M.X a 


X X 6 


Ser 


Asn 


He 


Xaa 


Trp 


Lys 


Tyr 


Gin 


Arg 


Tyr 


nlS 


fne 


xxe 


we u 


M.X a 


Tyr 


65 








70 










1 D 










fin 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


T 1 <a 

j-xe 


X xe 


Leu 




n X o 


X J-tr 








o c 
oo 










90 










95 




Val 


Ser 


Leu 


Phe 
100 


Cys 


Cys 


He 


Cys 


Lys 
105 


Arg 


Arg 


Lys 


Lys 


Asp 
110 


Lys 


Thr 


Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 


Gin 


Lys 


Lys 


Leu 




115 










120 










125 








His 


Asp 
130 


Phe 


Glu 


Glu 


Gin 


Cys 
135 


Val 


Glu 


Met 


Tyr 


Phe 
140 


Asn 


Glu 


Lys 


Asp 


Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


Phe 


Glu 


145 








150 










155 










160 


Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly 


Asp 


Pro 


Cys 








165 










170 










175 




Gin 


Leu 


His 


Lys 


Lys 


He 


He 


Thr 


He 


He 


Arg 


Phe 


Ser 


Asn 


Trp 


Pro 








180 








185 










190 






Phe 


Ala 


Arg 


Ser 


Phe 


Ser 


Pro 


Asp 


Gly Arg 


Tyr 


He 


Lys 


Asn 


Thr 


His 






195 










200 










205 








Trp 


Pro 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 


Glu 


He 


Thr 


Arg 


210 








215 










220 










Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 












225 










230 










235 













<210> 5 

<211> 1579 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 

<222> (368) . . . (368) 

<223> g or c 

<221> unsure 

<222> (372) . • . (372) 

<223> g or c 

<221> unsure 

<222> (374) . . . (374) 

<223> g or a 

<221> unsure 

<222> (375) . . . (375) 

<223> g or c 
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<221> unsure 

<222> (387) . . . (387) 



<221> unsure 

<222> (482) . . . (482) 



<400> 5 

acgtcgcctg caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc ggcatggtgt 60 

tgtaaataca cttagctcct ctcttcctca aggtgatctt gaaagtaata atccrtttca 120 

ttgtaatatt ttaatgaaag atgacaaaga tccccagtgt aatatatttg gtcaagactt 180 

acctgcagta ccccagagaa aagaatttaa ttttccagag gctggttcct cttcrggtgc 24 0 

cttattccca agtgctgttt cccctccaga actgcgacag agactacatg gggtagaact 300 

cttaaaaata tttaataaaa atcaaaaatt aggcagttca tctactagca taccacatct 360 

gtcatccsca csarscaaat tttttgntag tacaccatct cagccaagtt gcaaaagcca 420 

cttggaaact ggaaccaaag atcaagaaac tgtttgctct aaagctacag aaggagataa 480 

tncagaattt ggagcatttg taggacacag agatagcatg gatttacaga ggtttaaaga 54 0 

aacatcaaac aagataaaaa tactatccaa taacaatact tctgaaaaca ctttgaaacg 600 

agtgagttct cttgctggat ttactgactg tcacagaact tccattcctg ttcattcaaa 660 

acaagaaaaa atcagtagaa ggccatctac cgaagacact catgaagtag attccaaagc 720 

agctttaata ccggtttgta gatttcaact aaacagatat atattattaa atacattaaa 7 80 

cttttttaga taagatctac aaagtggtga tatttgggac tatatcaaaa attcaaaaaa 84 0 

atttttctta agaaaactga ctttagcata gtagcagtta cagaaaagtt tcttacagtg 900 

aatagtcagg aattttaaag aaaaatttat gcagaataaa ggcaggaatc tctttttgtt 960 

tgaattgaag ctaattatat gaactcattt ccagctaact gcgataatga ttgattttgc 1020 

aaattccctt taaaagcaca cactgacaag acaaaaagct caggaaaagg cagaaaaatt 1080 

actcctttat aatcaagtat tatatataag tcagtgctca taattttgct caagaaaata 114 0 

ttgacttaca ttcatatata tctgttctgg catagagaga ttatgttgtt aaaatcatgt 1200 

tattgaaaaa agttatttca gtggggaaag aggttagtta acaaagagat tcacagtaac 1260 

aaatcctcct ttctggaggg actcttcctg accctgagct gcacaacttt gcaacaaatt 1320 

aaagcctaac cgaagatgac ctcacaatgg caatttagaa ctcatgggag tcaacttaca 1380 

taaacggtat ttgatttctg ataagatagt ggaattattg gttatagatg acaaaataag 1440 

tatgtttaaa gtgatgatgg acataaaaaa gttttaaata taaaacatga gaaaagaagg 1500 

agatactatt caaaaagact ggcaaatttg aaaaactaga aataaaaaaa aaaaaaaaaa 1560 

atgagcggcc gcaagcttt 1579 



<210> 6 

<211> 243 

<212> PRT 

<213> Homo Sapiens 



<220> 

<221> UNSURE 

<222> (103) . . . (105) 

<223> UNBCNOWN 



<221> UNSURE 

<222> (109) . . . (109) 

<223> UNKNOWN 



<221> UNSURE 

<222> (141) . . - (141) 

<223> UNKNOWN 



<400> 6 

Val Asn Thr Leu Ser Ser Ser Leu 

1 5 
Asn Pro Phe His Cys Asn lie Leu 
20 

Cys Asn lie Phe Gly Gin Asp Leu 
35 40 



Pro Gin Gly Asp Leu Glu Ser Asn 

10 15 
Met Lys Asp Asp Lys Asp Pro Gin 
25 30 
Pro Ala Val Pro Gin Arg Lys Glu 
45 
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f ne 


Asn 


fne 


Pro 


uXU 


A J. a 


Gly 


Ser 


oer 




Gly Ala 


Leu 


Phe 


Pro 
















55 










bU 










A±a 


vax 


Ser 


Pro 


Pro 


pi 11 

uxU 


Leu Arg 


uJin 


Arg 


Leu 


nxs 


Gly Val 


uXU 


Leu 


Db 




















/ b 










Qn 


Leu 


Lys 


lie 


Phe 


Asn 


Lys 


Asn 


Gin 


Lys 


Leu 


uxy 


Ser 


Ser 


Ser 


i nr 


C A V 

oer 










85 










on 














He 


Pro 


His 


Leu 


Ser 


Ser 


Xaa 


Xaa 


Xaa 


Lys 


Phe 


Phe 


Xaa 


Ser 


Thr 


Pro 








100 










xu b 










110 






Ser 


Gin 


Pro 


Ser 


Cys 


Lys 


oer 


nxs 


Leu 


Glu 


Thr 


Gly 


Thr 


Lys 


Asp 


Gin 






TIC 

lib 










120 










125 








Glu 


Thr 


Val 


Cys 


Ser 


Lys 


Ala 


Thr 


Glu 


Gly 


Asp 


Asn 


Xaa 


Glu 


Pne 


Gly 














135 










140 










Ala 


Phe 


Val 


Gly His 


Arg 


Asp 


Ser 


Met 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


Glu 


145 










150 










155 










160 


Thr 


Ser 


Asn 


Lys 


He 


Lys 


He 


Leu 


Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


Glu 


Asn 










165 










170 










175 




Thr 


Leu 


Lys 


Arg 


Val 


Ser 


Ser 


Leu 


Ala 


Gly 


Phe 


Thr 


Asp 


Cys 


His 


Arg 








180 










185 










190 






Thr 


Ser 


He 


Pro 


Val 


His 


Ser 


Lys 


Gin 


Glu 


Lys 


He 


Ser 


Arg 


Arg 


Pro 






195 










200 










205 








Ser 


Thr 


Glu 


Asp 


Thr 


His 


Glu 


Val 


Asp 


Ser 


Lys 


Ala 


Ala 


Leu 


He 


Pro 




210 










215 










220 










Val 


Cys 


Arg 


Phe 


Gin 


Leu 


Asn 


Arg 


Tyr 


He 


Leu 


Leu 


Asn 


Thr 


Leu 


Asn 



225 230 235 240 

Phe Phe Arg 



<210> 7 

<211> 3532 

<212> DNA 

<213> Mus Musculus 

<220> 

<221> unsure 

<222> (2420) . . . (2420) 

<223> unknown 

<221> unsure 

<222> (2434) . . . (2434) 

<223> unknown 

<221> unsure 

<222> (2461) . . . (2461) 

<223> un)cnown 

<221> unsure 

<222> (2466) . . . (2466) 

<223> unknown 

<221> unsure 

<222> (2470) . . . (2470) 

<223> unknown 

<400> 7 

attatggctt atcatgaaaa accagtcctg cctcctcctc ttatcatcct cagccatata 60 

gtttcactgt tttgctgtgt atgcaaaaga agaaagaaag ataagacttc cgatgggcca 120 

aaacttttct taacagaaga agatcaaaag aaactccatg attttgaaga gcagtgtgtt 180 

gagatgtact ttgatgagaa agatgacaaa ttcaattctg ggagtgaaga gagaatccgg 24 0 

gtcacttttg aaagagtgga gcagatgagc attcagatta aagaagttgg agatcgtgtc 300 

aactacataa aaagatcatt acagtcttta gattctcaaa ttggtcatct gcaagatctc 360 
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tcagccctaa cagtagatac attgaaaaca cttacagccc agaaagcttc agaagctagt 420 

aaagtgcaca atgagatcac acgagaatrg agtatttcca aacacttggc tcagaatctt 480 

attgatgatg ttcctgtaag acctttgtgg gaagaaccta gtgctgtaaa cacactgagt 540 

tcctctcttc ctcaaggtga tcgggaaagt aataatcctt ttctttgtaa tatttttatg 600 

aaagatgaaa aagaccccca atataatctg tttggacaag atttgcccgt gataccccag 660 

agaaaagaat tcaacattcc agaggctggt tcctcctgtg gtgccttatt cccaagtgct 720 

gtttctcccc cagaattacg acagagacga catggggtag aaatgttaaa aatatttaat 780 

aaaaatcaaa aattaggcag ttcacctaat agttcaccac atatgtcctc cccaccaacc 840 

aaattttctg tgagtacccc atcccagcca agttgcaaaa gtcacttgga atccacaacc 900 

aaagatcaag aacccatttt ctataaagct gcagaagggg ataacataga atttggagca 960 

tttgtgggac acagagatag tatggactta cagaggttta aagaaacatc aaacaaaata 1020 

agagaactgt tatctaatga tactcctgaa aacactctga aacatgtggg tgctgctgga 1080 

tatagtgaat gttgtaagac ttctacttct cttcactcgg tgcaagcaga aagctgtagt 1140 

agaagagcgt cgacggaaga ctctccagaa gtcgattcta aagcagcttt gttaccggat 1200 

tggttacgag atagaccatc aaacagagaa atgccatctg aaggaggaac attaaatggt 1260 

cttgcttctc catttaagcc cgttttggat acaaattact attattcagc tgtggaaaga 1320 

aataacctga tgaggttgtc acagagtatt cccttcgttc ctgtacctcc acgaggcgag 1380 

cctgtcacag tgtaccgtct ggaggagagt tctcccagta tactgaataa cagcatgtct 1440 

tcatggtctc agctaggcct ctgtgccaaa attgagtttt taagtaaaga ggaaatggaa 1500 

ggtggtttac gaagagcagt caaagtgctg tgtacctggt cagagcacga tatcctgaag 1560 

tcagggcatc tctatatcat taagtcattt cttcctgagg tgataaacac atggtcaagc 1620 

atttataaag aagatacggt tctacatctc tgtctcagag aaatacaaca acagagagca 168 0 

gcacaaaagc tcacatttgc ctttaatcag atgaaaccca aatccatacc atattctcca 17 4 0 

aggttccttg aagttttcct gttgtactgc cattcagcag ggcagtggtt tgctgtagaa 1800 

gagtgcatga ctggtgaatt tagaaaatac aacaacaata atggtgatga aatcattcct 1860 

acaaatactc tagaagagat catgctagcc tttagccact ggacctatga atataccaga 1920 

ggggagttac tggtacttga cttacaagga gtgggagaaa acttgactga cccatctgta 1980 

ataaaagctg aagaaaaaag atcctgtgac atggtttttg gccctgccaa tctaggagaa 2040 

gatgcaataa aaaacttcaa gagccaaaca tccactgtaa ttcttgctgt cgaaagctta 2100 

aacttcccag atttgaagag gaatgactac acgcccttga taaaattata tttcctcagg 2160 

atgagtcatc agatttgaat cttcaatctg gaaattccac caaagaatca gaagcaacaa 2220 

attctgttcg tctgatgtta tagtgctgag tcattggttt ttgcctacac ttcacaaaag 2280 

tgtaactgtc agttttcctt tcgggggaat tgatgatata ggaagatgtg tgcaaaatga 2340 

gcttgctggc cccacacata gtctagaggt aatgttctca ttgaaaaacg cctggaggtg 2400 

gaggctgcag atgccagtgn aaagtgctag ctgncagaga gtcagtgctc tcgggctggt 2460 

naaggncggn acccttgctg ctgagagtgg tggttctctt cacctggtgc aggaccatta 2520 

accaaagtca agtcttcaga tttgattggc tgctcagtca cagcccattc agctaaggaa 2580 

actaaattgc gcagcttttt aaatggctga agtcttcctc agtttgtgct ctatgataat 2640 

gatgttagct ctcaactagg tgtttgtggc cacgggagaa ctactcctta caattttgct 2700 

tcacaggcat gttacaaagc ctgcactgaa aaccgtttgt cttccctctc tccctccctc 2760 

ttttccctgt agtattgagg atcaaaccca gggcctcatg aagaccattt tctaagagac 2820 

attttattta agaatcaact atagagtcta tgtttatgga tacagccagt ttttgttaaa 2880 

caaaacctga attgtgcaaa agggtttttt aacatttatc aatgttaagt aaaagaaagc 2940 

catgataaat aagaattaac tcactgttca atgggtgttt cctgtgagga aggttacagt 3000 

tgtaacagcc tgcagttgca tacatctcca aagatttaca gacttagtgt atcaaatcag 3060 

agtgtcatgt gagctctcac attgaaaatt ctataggaat gtgtcaatgt gaattctatt 3120 

tctggnactt aagaaatcag ttgttggatt atccttatac agtataggga gatcacaata 3180 

caactttatg ccaataaaat ctaacttaat tgcccagata tttttgcata tttagcaaca 3240 

agaaaagctt atcatttgac tcaagtttta tgctttctct ttcttttcat ttcctaggta 3300 

ctaattttaa tttttatttg gaaggagcag tgtaaagctt acttgtattc aatagtgtat 3360 

ctcatagata cagacaaggc cgcagagata agctgttaaa tagtgtttaa tgttgatgtg 3420 

gagagaaagg tgtattactt aaaaatacta taccatatac gttttgtata tcattaaatc 3480 

tttaaaagaa attaaattta ttcttgttta aaaaaaaaaa aaaaaaaaaa aa 3532 
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Ile Met Ala Tyr His Glu Lys Pro Vai Leu Pro Pro Pro Leu lie lie 

15 10 15 

Leu Ser His He Val Ser Leu Phe Cys Cys Val Cys Lys Arg Arg Lys 

20 25 30 

Lys Asp Lys Thr Ser Asp Gly Pro Lys Leu Phe Leu Thr Glu Glu Asp 

35 40 45 

Gin Lys Lys Leu His Asp Phe Glu Glu Gin Cys Val Glu Met Tyr Phe 

50 55 60 

Asp Glu Lys Asp Asp Lys Phe Asn Ser Gly Ser Glu Glu Arg He Arg 
65 70 75 80 

Val Thr Phe Glu Arg Val Glu Gin Met Ser He Gin He Lys Glu Val 

85 90 95 

Gly Asp Arg Val Asn Tyr He Lys Arg Ser Leu Gin Ser Leu Asp Ser 

100 105 110 

Gin He Gly His Leu Gin Asp Leu Ser Ala Leu Thr Val Asp Thr Leu 

115 120 125 

Lys Thr Leu Thr Ala Gin Lys Ala Ser Glu Ala Ser Lys Val His Asn 

130 135 140 

Glu He Thr Arg Glu Leu Ser He Ser Lys His Leu Ala Gin Asn Leu 
145 150 155 160 

He Asp Asp Val Pro Val Arg Pro Leu Trp Glu Glu Pro Ser Ala Val 

165 170 175 

Asn Thr Leu Ser Ser Ser Leu Pro Gin Gly Asp Arg Glu Ser Asn Asn 

180 185 190 

Pro Phe Leu Cys Asn He Phe Met Lys Asp Glu Lys Asp Pro Gin Tyr 

195 200 205 

Asn Leu Phe Gly Gin Asp Leu Pro Val He Pro Gin Arg Lys Glu Phe 

210 215 * 220 

Asn He Pro Glu Ala Gly Ser Ser Cys Gly Ala Leu Phe Pro Ser Ala 
225 230 235 240 

Val Ser Pro Pro Glu Leu Arg Gin Arg Arg His Gly Val Glu Met Leu 

245 250 255 

Lys He Phe Asn Lys Asn Gin Lys Leu Gly Ser Ser Pro Asn Ser Ser 

260 265 270 

Pro His Met Ser Ser Pro Pro Thr Lys Phe Ser Val Ser Thr Pro Ser 

275 280 285 

Gin Pro Ser Cys Lys Ser His Leu Glu Ser Thr Thr Lys Asp Gin Glu 

290 295 300 

Pro He Phe Tyr Lys Ala Ala Glu Gly Asp Asn He Glu Phe Gly Ala 
305 310 315 320 

Phe Val Gly His Arg Asp Ser Met Asp Leu Gin Arg Phe Lys Glu Thr 

325 330 335 

Ser Asn Lys He Arg Glu Leu Leu Ser Asn Asp Thr Pro Glu Asn Thr 

340 345 350 

Leu Lys His Val Gly Ala Ala Gly Tyr Ser Glu Cys Cys Lys Thr Ser 

355 360 365 

Thr Ser Leu His Ser Val Gin Ala Glu Ser Cys Ser Arg Arg Ala Ser 

370 375 380 

Thr Glu Asp Ser Pro Glu Val Asp Ser Lys Ala Ala Leu Leu Pro Asp 
385 390 395 400 

Trp Leu Arg Asp Arg Pro Ser Asn Arg Glu Met Pro Ser Glu Gly Gly 

405 410 415 

Thr Leu Asn Gly Leu Ala Ser Pro Phe Lys Pro Val Leu Asp Thr Asn 

420 425 430 

Tyr Tyr Tyr Ser Ala Val Glu Arg Asn Asn Leu Met Arg Leu Ser Gin 

435 440 445 

Ser He Pro Phe Val Pro Val Pro Pro Arg Gly Glu Pro Val Thr Val 

450 455 460 

Tyr Pro Ser Gly Gly Arg Val Leu Pro Val Tyr 
465 470 475 
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<210> 9 

<211> 5433 

<212> DNA 

<213> Mus Musculus 

<220> 

<221> unsure 

<222> (5094) . . . (5094) 

<223> unknown 



<400> 9 

ggctgaaaga gcctgagctg tgcctctcca ttccactgct gtggcagggt cagaaatctt 60 
ggatagagaa aaccttttgc aaacgggaat gtatctttgt aattcctagc acgaaagact 120 
ctaacaggtg ttgctgtggc cagttcacca accagcatat cccccctctg ccaagtgcaa 180 
cacccagcaa aaatgaagag gaaagcaaac aggtggagac tcagcctgag aaatggtctg 240 
ttgccaagca cacccagagc tacccaacag attcctatgg agttcttgaa ttccagggtg 300 
gcggatattc caataaagcc atgtatatcc gtgtatccta tgacaccaag ccagactcac 360 
tgctccatct catggtgaaa gattggcagc tggaactccc caagctctta atatctgtgc 420 
atggaggcct ccagaacttt gagatgcagc ccaagctgaa acaagtcttt gggaaaggcc 4 80 

tgatcaaggc tgctatgacc accggggcct ggatcttcac cgggggtgtc agcacaggtg 54 0 

ttatcagcca cgtaggggat gccttgaaag accactcctc caagtccaga ggccgggttt 600 
gtgctatagg aattgctcca tggggcatcg tggagaataa ggaagacctg gttggaaagg 660 

atgtaacaag agtgtaccag accatgtcca accctctaag taagctctct gtgctcaaca 720 
actcccacac ccacttcatc ctggctgaca atggcaccct gggcaagtat ggcgccgagg 780 
tgaagctgcg aaggctgctg gaaaagcaca tctccctcca gaagatcaac acaagactgg 840 
ggcagggcgt gcccctcgtg ggtctcgtgg tggagggggg ccctaacgtg gtgtccatcg 900 
tcttggaata cctgcaagaa gagcctccca tccctgtggt gatttgtgat ggcagcggac 960 

gtgcctcgga catcctgtcc tttgcgcaca agtactgtga agaaggcgga ataataaatg 1020 

agtccctcag ggagcagctt ctagttacca ttcagaaaac atttaattat aataaggcac 1080 

aatcacatca gctgtttgca attataatgg agtgcatgaa gaagaaagaa ctcgtcactg 1140 

tgttcagaat gggttctgag ggccagcagg acatcgagat ggcaatttta actgccctgc 1200 

tgaaaggaac aaacgtatct gctccagatc agctgagctt ggcactggct tggaaccgcg 1260 

tggacatagc acgaagccag atctttgtct ttgggcccca ctggacgccc ctgggaagcc 1320 

tggcaccccc gacggacagc aaagccacgg agaaggagaa gaagccaccc atggccacca 1380 

ccaagggagg aagaggaaaa gggaaaggca agaagaaagg gaaagtgaaa gaggaagtgg 1440 

aggaagaaac tgacccccgg aagatagagc tgctgaactg ggtgaatgct ttggagcaag 1500 

cgatgctaga tgctttagtc ttagatcgtg tcgactttgt gaagctcctg attgaaaacg 1560 

gagtgaacat gcaacacttt ctgaccattc cgaggctgga ggagctctat aacacaagac 1620 

tgggtccacc aaacacactt catctgctgg tgagggatgt gaaaaagagc aaccttccgc 1680 

ctgattacca catcagcctc atagacatcg ggctcgtgct ggagtacctc atgggaggag 1740 

cctaccgctg caactacact cggaaaaact ttcggaccct ttacaacaac ttgtttggac 1800 

caaagaggcc taaagctctt aaacttctgg gaatggaaga tgatgagcct ccagctaaag 1860 

ggaagaaaaa aaaaaaaaag aaaaaggagg aagagatcga cattgatgtg gacgaccctg 1920 

ccgtgagtcg gttccagtat cccttccacg agctgatggt gtgggcagtg ctgatgaaac 1980 

gccagaaaat ggcagtgttc ctctggcagc gaggggaaga gagcatggcc aaggccctgg 2040 

tggcctgcaa gctctacaag gccatggccc acgagtcctc cgagagtgat ctggtggatg 2100 

acatctccca ggacttggat aacaattcca aagacttcgg ccagcttgct ttggagttat 2160 

tagaccagtc ctataagcat gacgagcaga tcgctatgaa actcctgacc tacgagctga 2220 

aaaactggag caactcgacc tgcctcaaac tggccgtggc agccaaacac cgggacttca 2280 

ttgctcacac ctgcagccag atgctgctga ccgatatgtg gatgggaaga ctgcggatgc 234 0 

ggaagaaccc cggcctgaag gttatcatgg ggattcttct accccccacc atcttgtttt 24 00 

tggaatttcg cacatatgat gatttctcgt atcaaacatc caaggaaaac gaggatggca 2 4 60 

aagaaaaaga agaggaaaat acggatgcaa atgcagatgc tggctcaaga aagggggatg 2 520 

aggagaacga gcataaaaaa cagagaagta ttcccatcgg aacaaagatc tgtgaattct 2580 

ataacgcgcc cattgtcaag ttctggtttt acacaatatc atacttgggc tacctgctgc 2 640 

tgtttaacta cgtcatcctg gtgcggatgg atggctggcc gtccctccag gagtggatcg 2700 

tcatctccta catcgtgagc ctggcgttag agaagatacg agagatcctc atgtcagaac 2760 

caggcaaact cagccagaaa atcaaagttt ggcttcagga gtiactggaac atcacagatc 2820 

tcgtggccat ttccacattc atgattggag caattcttcg cctacagaac cagccctaca 2880 
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tgggctatgg ccgggtgatc tactgtgtgg atatcatctt ctggtacatc cgtctcctgc 
acatctttgg tgtcaacaag tatctgggcc catacgtgat gatgattgga aagangatga 
tcgacatgct gtactttgtg gtcatcatgc tggtcgtgct catgagtttc ggagtagccc 
gtcaagccat tctgcatcca gaggagaagc cctcttggaa actggcccga aacatcttct 
acatgcccta ctggatgatc tatggagagg tgtttgcaga ccagatagac ctctacgcca 
tggaaattaa tcctccttgt ggtgagaacc tatatgatga ggagggcaag cggcttcctc 
cctgtatccc cggcgcctgg ctcactccag cactcatggc gtgctatcta ctggtcgcca 
acatcctgct ggtgaacctg ctgattgctg tgttcaacaa tactttcttt gaagtaaaat 
caatatccaa ccaggtgtgg aagttccagc gatatcagct gattatgaca tttcatgaca 
ggccagtcct gcccccaccg atgatcattt taagccacat ctacatcatc attatgcgtc 
tcagcggccg ctgcaggaaa aagagagaag gggaccaaga ggaacgggat cgrggattga 
agctcttcct tagcgacgag gagctaaaga ggctgcatga gttcgaggag cagtgcgtgc 
aggagcactt ccgggagaag gaggatgagc agcagtcgtc cagcgacgag cgcatccggg 
tcacttctga aagagttgaa aatatgtcaa tgaggttgga agaaatcaat gaaagagaaa 

cttttatgaa aacttccctg cagactgttg accttcgact tgctcagcta gaagaattat 
ctaacagaat ggtgaatgct cttgaaaatc ttgcgggaat cgacaggtct gacctgatcc 

aggcacggtc ccgggcttct tctgaatgtg aggcaacgta tcttctccgg caaagcagca 
tcaatagcgc tgatggctac agcttgtatc gatatcattt taacggagaa gagttattat 
ttgaggatac atctctctcc acgtcaccag ggacaggagt caggaaaaaa acctgttcct 
tccgtataaa ggaagagaag gacgtgaaaa cgcacctagt cccagaatgt cagaacagtc 
ttcacctttc actgggcaca agcacatcag caaccccaga tggcagtcac cttgcagtag 
atgacttaaa gaacgctgaa gagtcaaaat taggtccaga tattgggatt tcaaaggaag 
atgatgaaag acagacagac tctaaaaaag aagaaactat ttccccaagt ttaaataaaa 
cagatgtgat acatggacag gacaaatcag atgttcaaaa cactcagcta acagtggaaa 
cgacaaatat agaaggcact atttcctatc ccctggaaga aaccaaaatt acacgctatt 
tccccgatga aacgatcaat gcttgtaaaa caatgaagtc cagaagcttc gtctattccc 
ggggaagaaa gctggtcggt ggggttaacc aggatgtaga gtacagttca atcacggacc 
agcaattgac gacggaatgg caatgccaag ttcaaaagat cacgcgctct catagcacag 
atattcctta cattgtgtcg gaagctgcag tgcaagctga gcaaaaagag cagtttgcag 
atatgcaaga tgaacaccat gtcgctgaag caattcctcg aatccctcgc ttgtccctaa 
ccattactga cagaaatggg atggaaaact tactgtctgt gaagccagat caaactttgg 
gattcccatc tctcaggtca aaaagtttac atggacatcc taggaatgtg aaatccattc 
agggaaagtt agacagatct ggacatgcca gtagtgtaag cagcttagta attgtgtctg 
gaatgacagc agaagaaaaa aaggttaaga aagagaaagc ttccacagaa actgaatgct 
agtctgtttt gtttctttaa tttttttttt taacagtcag aaacccacta atgggtgtca 
tcttggccca tcctaaacac atmtccaatt tcctaaaaac attttccctt aaaaaatttt 
ggaaattcag acttgattta caatttaatg cactaaaagt agtattttgt tagnatatgt 
tagtaggctt agttttttca gttgcagtag tatcaaatga aagtgatgat actgtaacga 
agataaattg gctaatcagt atacaagatt atacaatctc tttattactg agggccacca 
aatagcctag gaagtgccct cgagcactga agtcaccatt aggtcactca agaagtaagc 
aactagctgg gcacagtggc tcatgcctgt aatcctagca ctttgggagg ccaaggcaga 
aagatagctt gagtccagga gtttgagacc agcctgggca acatagtgat accccatctc 
ttaaaaaaaa aaaaaaaaaa ctgccctcgt gcc 
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3000 
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3300 
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5433 
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Thr Lys Pro Asp Ser Leu Leu His 

10 15 
Glu Leu Pro Lys Leu Leu lie Ser 
25 30 
Glu Met Gin Pro Lys Leu Lys Gin 
45 

Ala Ala Met Thr Thr Gly Ala Trp 
60 
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HOD 










440 










445 








Lys 


oer 


Asn 


Leu 


Pro 


Pro 


Asp 


Tyr 


His 


He 


Ser 


Leu 


He 


Asp 


He 


Glv 




450 










455 










460 










Leu 


Val 


Leu 


Glu 


Tyr 


Leu 


Met 


Gly 


Gly 


Ala 


Tyr 


Arg 


Cys 


Asn 


Tyr 


Thr 


465 










470 










475 










480 


Arg 


Lys 


Asn 


Phe 


Arg 


Thr 


Leu 


Tyr 


Asn 


Asn 


Leu 


Phe 


Gly 


Pro 


Lys 


Arg 










485 










490 










495 




Pro 


Lys 


Ala 


Leu 


Lys 


Leu 


Leu 


Gly 


Met 


Glu 


Asp 


Asp 


Glu 


Pro 


Pro 


Ala 






500 










505 










510 






Lys 


Gly 


Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


He 


Asp 


He 






515 










520 










525 








Asp 


Val 


Asp 


Asp 


Pro 


Ala 


Val 


Ser 


Arg 


Phe 


Gin 


Tyr 


Pro 


Phe 


His 


Glu 




530 










535 










540 
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Leu 


Met 


Val 


Trp 


Ala 


Val 


Leu 


Met 


Lys 


Arg 


Gin 


Lys 


Met 


Ala 


Val 


Phe 


545 










550 










555 










560 


Leu 


Trp 


Gin 


Arg 


Gly 


Glu 


Glu 


Ser 


Met 


Ala 


Lys 


Ala 


Leu 


Val 


Ala 


Cys 










565 










570 










575 




Lys 


Leu 


Tyr 


Lys 


Ala 


Met 


Ala 


His 


Glu 


Ser 


Ser 


Glu 


Ser 


Asp 


Leu 


Val 








580 










585 










590 






Asp Asp 


He 


Ser 


Gin 


Asp 


Leu 


Asp 


Asn 


Asn 


Ser 


Lys 


Asp 


Phe 


Gly 


Gin 






595 










600 










605 








Leu 


Ala 


Leu 


Glu 


Leu 


Leu 


Asp 


Gin 


Ser 


Tyr 


Lys 


His 


Asp 


Glu 


Gin 


He 




610 










615 










620 










Ala 


Met 


Lys 


Leu 


Leu 


Thr 


Tyr 


Glu 


Leu 


Lys 


Asn 


Trp 


Ser 


Asn 


Ser 


Thr 


625 










630 










635 










640 


Cys 


Leu 


Lys 


Leu 


Ala 


Val 


Ala 


Ala 


Lys 


His 


Arg 


Asp 


Phe 


He 


Ala 


His 










645 










650 










655 




Thr 


Cys 


Ser 


Gin 


Met 


Leu 


Leu 


Thr 


Asp 


Met 


Trp 


Met 


Gly Arg 


Leu 


Arg 








660 










665 










670 






Met 


Arg 


Lys 


Asn 


Pro 


Gly 


Leu 


Lys 


Val 


He 


Met 


Gly 


He 


Leu 


Leu 


Pro 






675 










680 










685 








Pro 


Thr 


He 


Leu 


Phe 


Leu 


Glu 


Phe 


Arg 


Thr 


Tyr 


Asp 


Asp 


Phe 


Ser 


Tyr 




690 










695 










700 










Gin 


Thr 


Ser 


Lys 


Glu 


Asn 


Glu 


Asp 


Gly 


Lys 


Glu 


Lys 


Glu 


Glu 


Glu 


Asn 


705 










710 










715 










720 


Thr Asp 


Ala 


Asn 


Ala 


Asp 


Ala 


Gly 


Ser 


Arg 


Lys 


Gly Asp 


Glu 


Glu 


Asn 










725 










730 










735 




Glu 


His 


Lys 


Lys 


Gin 


Arg 


Ser 


He 


Pro 


He 


Gly 


Thr 


Lys 


He 


Cys 


Glu 








740 










745 










750 






Phe 


Tyr 


Asn 


Ala 


Pro 


He 


Val 


Lys 


Phe 


Trp 


Phe 


Tyr 


Thr 


He 


Ser 


Tyr 




755 










760 










765 








Leu Gly 


Tyr 


Leu 


Leu 


Leu 


Phe 


Asn 


Tyr 


Val 


He 


Leu 


Val 


Arg 


Met 


Asp 




770 








775 










780 










Gly 


Trp 


Pro 


Ser 


Leu 


Gin 


Glu 


Trp 


He 


Val 


He 


Ser 


Tyr 


He 


Val 


Ser 


785 










790 










795 










800 


Leu 


Ala 


Leu 


Glu 


Lys 


He 


Arg 


Glu 


He 


Leu 


Met 


Ser 


Glu 


Pro 


Gly 


Lys 










805 










810 










815 




Leu 


Ser 


Gin 


Lys 


He 


Lys 


Val 


Trp 


Leu 


Gin 


Glu 


Tyr 


Trp 


Asn 


He 


Thr 








820 










825 










830 






Asp 


Leu 


Val 


Ala 


He 


Ser 


Thr 


Phe 


Met 


He 


Gly Ala 


He 


Leu Arg 


Leu 






835 










840 










845 








Gin 


Asn 


Gin 


Pro 


Tyr 


Met 


Gly 


Tyr 


Gly 


Arg 


Val 


He 


Tyr 


Cys 


Val 


Asp 




850 










855 










860 










He 


He 


Phe 


Trp 


Tyr 


He 


Arg 


Val 


Leu 


Asp 


He 


Phe 


Gly Val Asn 


Lys 


865 










870 










875 










880 


Tyr 


Leu 


Gly 


Pro 


Tyr 


Val 


Met 


Met 


He 


Gly 


Lys 


Met 


Met 


He 


Asp Met 










885 










890 










895 




Leu 


Tyr 


Phe 


Val 


Val 


He 


Met 


Leu 


Val 


Val 


Leu 


Met 


Ser 


Phe 


Gly 


Val 






900 










905 










910 






Ala 


Arg 


Gin 


Ala 


He 


Leu 


His 


Pro 


Glu 


Glu 


Lys 


Pro 


Ser 


Trp 


Lys 


Leu 






915 










920 










925 








Ala 


Arg 


Asn 


He 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


He 


Tyr 


Gly 


Glu 


Val 




930 










935 










940 










Phe 


Ala 


Asp 


Gin 


He 


Asp 


Leu 


Tyr 


Ala 


Met 


Glu 


He 


Asn 


Pro 


Pro 


Cys 


945 










950 










955 










960 


Gly 


Glu 


Asn 


Leu 


Tyr 


Asp 


Glu 


Glu 


Gly 


Lys 


Arg 


Leu 


Pro 


Pro 


Cys 


He 










965 










970 










975 




Pro 


Gly 


Ala 


Trp 


Leu 


Thr 


Pro 


Ala 


Leu 


Met 


Ala 


Cys 


Tyr 


Leu 


Leu 


Val 








980 










985 










990 






Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


Ala 


Val 


Phe 


Asn 


Asn 


Thr 






995 










1000 








1005 






Phe 


Phe 


Glu 


Val 


Lys 


Ser 


He 


Ser 


Asn 


Gin 


Val 


Trp 


Lys 


Phe 


Gin 


Arg 




1010 








1015 








1020 
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Tyr Gin Leu lie Met Thr Phe His Asp Arg Pro Val Leu Pro Pro Pro 
1025 1030 1035 104 

Met lie lie Leu Ser His lie Tyr lie lie lie Met Arg Leu Ser Gly 

1045 1050 1055 

Arg Cys Arg Lys Lys Arg Glu Gly Asp Gin Glu Glu Arg Asp Arg Gly 

1060 1065 1070 

Leu Lys Leu Phe Leu Ser Asp Glu Glu Leu Lys Arg Leu His Glu Phe 

1075 1080 1085 

Glu Glu Gin Cys Val Gin Glu His Phe Arg Glu Lys Glu Asp Glu Gin 

1090 1095 1100 

Gin Ser Ser Ser Asp Glu Arg lie Arg Val Thr Ser Glu Arg Val Glu 
1105 1110 1115 112 

Asn Met Ser Met Arg Leu Glu Glu lie Asn Glu Arg Glu Thr Phe Met 

1125 1130 1135 

Lys Thr Ser Leu Gin Thr Val Asp Leu Arg Leu Ala Gin Leu Glu Glu 

1140 1145 1150 

Leu Ser Asn Arg Met Val Asn Ala Leu Glu Asn Leu Ala Gly lie Asp 

1155 1160 1165 

Arg Ser Asp Leu lie Gin Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu 

1170 1175 1180 

Ala Thr Tyr Leu Leu Arg Gin Ser Ser lie Asn Ser Ala Asp Gly Tyr 
1185 1190 1195 120 

Ser Leu Tyr Arg Tyr His Phe Asn Gly Glu Glu Leu Leu Phe Glu Asp 

1205 1210 1215 

Thr Ser Leu Ser Thr Ser Pro Gly Thr Gly Val Arg Lys Lys Thr Cys 

1220 1225 1230 

Ser Phe Arg lie Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro 

1235 1240 1245 

Glu Cys Gin Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser Ala 

1250 1255 1260 

Thr Pro Asp Gly Ser His Leu Ala Val, Asp Asp Leu Lys Asn Ala Glu 
1265 1270 ' 1275 128 

Glu Ser Lys Leu Gly Pro Asp lie Gly lie Ser Lys Glu Asp Asp Glu 

1285 1290 1295 

Arg Gin Thr Asp Ser Lys Lys Glu Glu Thr lie Ser Pro Ser Leu Asn 

1300 1305 1310 

Lys Thr Asp Val lie His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr 

1315 1320 1325 

Gin Leu Thr Val Glu Thr Thr Asn He Glu Gly Thr He Ser Tyr Pro 

1330 1335 1340 

Leu Glu Glu Thr Lys He Thr Arg Tyr Phe Pro Asp Glu Thr He Asn 
1345 1350 1355 136 

Ala Cys Lys Thr Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg 

1365 1370 1375 

Lys Leu Val Gly Gly Val Asn Gin Asp Val Glu Tyr Ser Ser He Thr 

1380 1385 1390 

Asp Gin Gin Leu Thr Thr Glu Trp Gin Cys Gin Val Gin Lys He Thr 

1395 • 1400 1405 

Arg Ser His Ser Thr Asp He Pro Tyr He Val Ser Glu Ala Ala Val 

1410 1415 1420 

Gin Ala Glu Gin Lys Glu Gin Phe Ala Asp Met Gin Asp Glu His His 
1425 1430 1435 144 

Val Ala Glu Ala He Pro Arg He Pro Arg Leu Ser Leu Thr He Thr 

1445 1450 1455 

Asp Arg Asn Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr 

1460 1465 1470 

Leu Gly Phe Pro Ser Leu Arg Ser Lys Ser Leu His Gly His Pro Arg 

1475 1480 1485 

Asn Val Lys Ser He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser 
1490 1495 1500 
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Ser Val Ser Ser Leu Val lie Val Ser Gly Met Thr Ala Glu Glu Lys 
1505 1510 1515 152 

Lys Val Lys Lys Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1525 1530 

<210> 11 

<211> 6220 

<212> DNA 

<213> Homo Sapiens 



<400 
tgtgcagaat 
ttccctgtgc 
tgcagtcacc 
actcccccag 
gtgcaggctg 
ctctcagaac 
tctctagaac 
ttgcagctgg 
gcaggaggag 
tctccggcgc 
caatgacaag 
cgtgtatttt 
ctacacgcat 
gtgggaccca 
cacgggcctg 
cgtgatctac 
ggtgaccggg 
aggcctggtc 
cggcgtcatg 
ggaaggcgag 
cctgatccat 
gaacctgacc 
cggccagtac 
gaccaaggaa 
cggcccgggc 
tgtggagggc 
ggacatcact 
gaccttcacg 
gcggcagctg 
catcttgcag 
ggaccaccag 
cttcatggat 
catctccaac 
gtttgtcacc 
ccacagcaag 
ggcgccccgc 
cacgcagccg 
tccccacgtc 
aggccatgtg 
gaaccgtcgg 
cttggcctgc 
ggaggagatg 
gtgctaccgg 
ggggaagacc 
cgggggcatc 
gctgtggcgt 
cttcagggag 
caccgcaccc 
gttcgcctac 



> 11 

tgtacagttg 
ggtgctcagt 
tgaggttgtt 
agtgtccggg 
atgaagagaa 
atcagcctca 
cccagtgtag 
cgtgggggtc 
ggctttgagg 
agcaacagca 
caagaaagcc 
gtggaaagtt 
gagcagcact 
aagaaacatg 
agccagaagg 
cacctcatga 
ggggccaaga 
aaggtggctc 
aagcaggtag 
ctcatcacca 
cccacgggca 
tgcctagaca 
ggggtggaga 
agaggaggtg 
acgttgcaca 
tcgggccgcg 
atctccctga 
gaaagcagga 
ctgactgtct 
gccttgctga 
ctgaaactgg 
gagtggcagt 
aagcctgagt 
tgggacacct 
ctgcaaaagg 
ctgcagatgc 
ctttatcccc 
aagctcaacg 
accttcacca 
gagctggcag 
agcaagatcc 
ctggcgctgg 
aaggacgaag 
acctgcctgc 
caggccttcc 
gtgaccctgt 
aagaggctgc 
gtggtggtct 
gtgctcatgg 



cgaaaccatg 
gcatctgcac 
accattatga 
ggccccagcc 
ccggatgagg 
tcttcctgtc 
cgagctggag 
tcagaatgga 
ggctgcccag 
gcctcttcaa 
tcagttcgtg 
ccaaactgtc 
tggaggaggc 
tccaggagat 
tgaaaaagta 
cccagcactg 
acttcaacat 
agaccacagg 
gcgaggcggt 
tcggagtcgc 
gcttccccgc 
gcaaccactc 
ttcctctgag 
tggccatcaa 
ccatcgacaa 
tggccgacgt 
tccagcagaa 
ttgtcgagtg 
tccgggaagg 
aagcctcacg 
cagtggcatg 
ggaagccttc 
ttgtgaagct 
tgctctacct 
tgctggtgga 
accacgtggc 
ggccccggca 
tgcagggagt 
tggaccccat 
gaatcatctg 
tgaaggaact 
cggaggagta 
agagagccca 
agctcgccct 
tgaccaaggt 
gcatgctggc 
aggatgtggg 
tccacctgaa 
tggacttcca 



tcgctggcag 
ccgtggggga 
acggccgctg 
aagggacaca 
gcttcacatg 
tctgatctat 
agaggactgt 
gccctcagcc 
aagggtcact 
gagctggagg 
gattcctgaa 
tgatgctggg 
taccaagccc 
gccaaccgat 
cgtccgagtc 
ggggctggac 
gaagccgcgg 
ggcctggatc 
gcgggacttc 
cacctggggc 
cgagtacata 
tcacttcatc 
gaccaggctg 
gatccccatc 
cgccaccacc 
cattgcccag 
actgagcgtg 
gaccaaaaag 
caaggatggt 
gagccaagac 
gaatcgcgtg 
agatctgcac 
cttcctggaa 
gtacgagaac 
ggatcccgag 
ccaggtgctg 
caacgaccgg 
gagcctccgg 
ccgtgacctt 
ggctcagagc 
gtccaaggag 
tgagcacaga 
gaaactgctc 
ggaggccaag 
gtggtggggc 
cttcccgctg 
cacccccgcg 
catcctctcc 
gcctgtgccc 



ctggtgctgg 
gggagctctt 
ggacccccgc 
tctcacgcag 
aggaagcatg 
ttcaccaacc 
cctgagggca 
ctgaggaaag 
gacctgggga 
ctacagtgcc 
aacatcaaga 
aaggtggtgt 
cacaccttcc 
gcctttggcg 
tcccaggaca 
gtccccaatc 
ctgaagagca 
atcacagggg 
agcctgagca 
actgtccacc 
ctggatgagg 
ctcgtggacg 
gagaagttca 
gtgtgcgtgg 
aacggcaccc 
gtggccaacc 
ttcttccagg 
atccaagata 
cagcaggacg 
cactttggcc 
gacattgccc 
cccacgatga 
aacggggtgc 
ctggacccct 
cgcccggctt 
cgggagctgc 
ctgcggctcc 
tccctctaca 
ctcatttggg 
caggactgca 
gaggaggaca 
gccatcgggg 
acccgcgtgt 
gacatgaagt 
cagctctccg 
ctcctcaccg 
gcccgcgccc 
tacttcgcct 
tcctggtgcg 



cggtggagac 
tctctggccc 
atgtgcatgt 
ctgggaacat 
tggccaggtc 
accccatgtg 
gcaggcctgg 
ctggctcgga 
tggtctccaa 
ccttcggcaa 
agaaagaatg 
gtcagtgtgg 
agggcacaca 
acatcgtctt 
cgccctccag 
tcttgatctc 
ttttccgcag 
ggtcccacac 
gcagctacaa 
gccgcgaggg 
atggccaagg 
acgggaccca 
tatcggagca 
tgctggaggg 
cctgtgtggt 
tgcctgtctc 
agatgtttga 
ttgtccggag 
tggatgtggc 
acgagaactg 
gcagtgagat 
cagctgcact 
agctgaagga 
cctgcctgtt 
gcgcgcccgc 
tgggggactt 
tgctgcccgt 
agcgttcctc 
ccattgtcca 
tcgcagcggc 
cggacagctc 
tcttcaccga 
ccgaggcctg 
ttgtgtctca 
tggacaatgg 
gcctcatctc 
gtgccttctt 
tcctctgcct 
agtgtgccat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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ctacctctgg ctcttctcct tggtgtgcga ggagatgcgg cagctcttct atgaccctga 3000 

cgagtgcggg ctgatgaaga aggcagcctt gtacttcagt gacttctgga ataagctgga 3060 

cgtcggcgca atcttgctct tcgtggcagg gctgacctgc aggctcatcc cggcgacgct 3120 

gtaccccggg cgcgtcatcc tctctctgga cttcatcctg ttctgcctcc ggctcatgca 3180 

catttttacc atcagtaaga cgctggggcc caagatcatc attgtgaagc ggatgatgaa 3240 

ggacgtcttc ttcttcctct tcctgctggc tgtgtgggtg gtgtccttcg gggtggccaa 3300 

gcaggccatc ctcatccaca acgagcgccg ggtggactgg ctgttccgag gggccgtcta 3360 

ccactcctac ctcaccatct tcgggcagat cccgggctac atcgacggtg tgaacttcaa 3420 

cccggagcac tgcagcccca atggcaccga cccctacaag cctaagtgcc ccgagagcga 34 80 

cgcgacgcag cagaggccgg ccttccctga gtggctgacg gtcctcctac tctgcctcta 3540 

cctgctcttc accaacatcc tgctgctcaa cctcctcatc gccatgttca actacacctt 3600 

ccagcaggtg caggagcaca cggaccagat ttggaagttc cagcgccatg acctgatcga 3660 

ggagtaccac ggccgccccg ccgcgccgcc ccccttcatc ctcctcagcc acctgcagct 3720 

cttcatcaag agggtggtcc tgaagactcc ggccaagagg cacaagcagc tcaagaacaa 37 80 

gctggagaag aacgaggagg cggccctgct atcctgggag atctacctga aggagaacta 3840 

cctccagaac cgacagttcc agcaaaagca gcggcccgag cagaagatcg aggacatcag 3900 

caataaggtt gacgccatgg tggacctgct ggacctggac ccactgaaga ggtcgggctc 3960 

catggagcag aggttggcct ccctggagga gcaggtggcc cagacagccc gagccctgca 4020 

ctggatcgtg aggacgctgc gggccagcgg cttcagctcg gaggcggacg tccccactct 4 080 

ggcctcccag aaggccgcgg aggagccgga tgctgagccg ggaggcagga agaagacgga 414 0 

ggagccgggc gacagctacc acgtgaatgc ccggcacctc ctctacccca actgccctgt 4200 

cacgcgcttc cccgtgccca acgagaaggt gccctgggag acggagttcc tgatctatga 4260 

cccacccttt tacacggcag agaggaagga cgcggccgcc atggacccca tgggagacac 4320 

cctggagcca ctgtccacga tccagtacaa cgtggtggat ggcctgaggg accgccggag 4 380 

cttccacggg ccgtacacag tgcaggccgg gttgcccctg aaccccatgg gccgcacagg 4440 

actgcgtggg cgcgggagcc tcagctgctt cggacccaac cacacgctgt accccatggt 4500 

cacgcggtgg aggcggaacg aggatggagc catctgcagg aagagcataa agaagatgct 4560 

ggaagtgctg gtggtgaagc tccctctctc cgagcactgg gccctgcctg ggggctcccg 4 620 

ggagccaggg gagatgctac ctcggaagct gaagcggatc ctccggcagg agcactggcc 4 680 

gtcttttgaa aacttgctga agtgcggcat ggaggtgtac aaaggctaca tggatgaccc 4740 

gaggaacacg gacaatgcct ggatcgagac ggtggccgtc agcgtccact tccaggacca 4800 

gaatgacgtg gagctgaaca ggctgaactc taacctgcac gcctgcgact cgggggcctc 4 860 

catccgatgg caggtggtgg acaggcgcat cccactctat gcgaaccaca agaccctcct 4 920 

ccagaaggca gccgctgagt tcggggctca ctactgactg tgccctcagg ctgggcggct 4980 

ccagtccata gacgttcccc ccagaaacca gggcttctct ctcctgagcc tggccaggac 5040 

tcaggctgtt cctgggccct gcacatgatg gggtttggtg gacccagtgc ccctcacggc 5100 

tgccgcaagt ctgctgcaga tgacctcatg aactggaagg ggtcaaggtg acccgggagg 5160 

agagctcaag acagggcaca ggctactcag agctgagggg cccctgggac ccttggccat 5220 

caggcgaggg gctgggcctg tgcagctggg cccttggcca gagtccactc ccttcctggc 5280 

tgtgtcaccc cgagcagctc atccaccatg gaggtcattg gcctgaggca agttccccgg 5340 

agagtcggga tcccctgtgg ccccctcagg cctatgtctg tgaggaaggg gccctgccac 5400 

tctccccaag agggcctcca tgtttcgagg tgcctcaaca tggagccttg cctggcctgg 54 60 

gctaggggca ctgtctgaac tcctgactgt caggataaac tccgtggggg tacaggagcc 5520 

cagacaaagc ccaggcctgt caagagacgc agagggcccc tgccagggtt ggccccaggg 5580 

accctgggac gaggctgcag aagctctccc tccctactcc ctgggagcca cgtgctggcc 5640 

atgtggccag ggacggcatg agcaggaggc ggggacgtgg gggccttctg gtttggtgtc 5700 

aacagctcac aggagcgtga accatgaggg ccctcaggag gggaacgtgg taaaacccaa 57 60 

gacattaaat ctgccatctc aggcctggct ggctcttctg tgctttccac aaataaagtt 5820 

cctgacacgt ccagggccag gggctgtgtg acggctgcct gaagttctcc tcgatccccc 5880 

ggtgagcttc ctgcagcctg tggatgtcct gcagcccctc agccctaccc ccaagtttct 5 94 0 

cctctgaccc atcagctccc tgtcttcatt ttcctaaacc tgggctccag catcgtcccc 6000 

aagcccacca ggccaggatg caggcatcca catgccctcc tccttggctt cccctgcgtg 6060 

gtggtgccaa tgtgccctgg cacccctgca gaggctccgg atggagcctg gggctgcctg 6120 

gccactgagc actggccgag gtgatgccca cccttccctg gacaggcctc tgtcttccac 6180 

ctgacccaaa gctctctagc cacccccttg tccccagtat 6220 
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<212> PRT 

<213> Homo Sapiens 



<400> 12 







Pro 


Ser 


Ala 


Leu 


Arg 


Lys 


Ala 


Gly 


Ser 


Glu 


Gin 


Glu 


Glu Gly 


1 








5 










10 










15 




irns 


uXU 


i3 j.y 


Leu 


Pro 


Arg 


Arg 


Val 


Thr 


Asp 


Leu 


Gly 


Met 


Val 


Ser 


Asn 






20 










25 










30 






Leu 


Arg 


AXC( 


OCX 


Asn 


Ser 


Ser 


Leu 


Phe 


Lys 


Ser 


Trp 


Arg 


Leu 


Gin 


Cys 




35 










40 










45 








Pro 


Pne 


Gly 


Asn 


Asn 


Asp 


Lys 


Gin 


Glu 


Ser 


Leu 


Ser 


Ser 


Tro 


He 


Pro 




50 








55 










60 










Glu 


Asn 


lie 


Lys 


Lys 


Lys 


r:i n 

V3X U 


Cys 


Val 


I yx 


Phe 


Val 


Glu 


Ser 


Ser 


Lys 


65 








70 










75 










O V 


Leu 


Ser 


Asp 


A±a 


o± y 


Lys 


vcix 


Val 


Cys 


Gin 


Cys 


Gly Tyr 


Thr 


His 


Glu 








o ^ 










90 














Gin 


His 


Leu 


Glu 
100 


Glu 




1 nr 


Lys 


Pro 
105 


His 


Thr 


Phe 


Gin 


Glv 

110 


rri Vi 

1 nr 


isxn 


Trp 


Asp 


Pro 


Lys 


Lys 


riXS 


Va 1 
Va J. 


Gin 


Glu 


Met 


Pro 


Thr 


Asp 


Ala 


XT lie 


oxy 


115 










120 










125 








Asp 


He 


Val 


Phe 


Thr 


Gly 


Leu 


Ser 


Gin 


Lys 


Va 1 
Vox 


Lys 


Lys 


lyr 


Vo 1 

vax 


Arg 


130 




















140 










Val 


Ser 


Gin 


Asp 


Thr 


fro 


Ser 


Ser 


Val 


T 1 o 
X xc 


Tyr 


His 


Leu 


Met 


i. 1 IX 


OXIl 


145 


















155 










1 fin 

X vU 


His 


Trp 


Gly 


Leu 


Asp 


vax 


irro 


Asn 


Leu 


Leu 


He 


Ser 


Val 


Thr 


Glv 


Glv 






165 










170 










X / 




Ala 


Lys 


Asn 


fne 


Asn 




T 

Lys 


Pro Arg 


Leu 


Lys 


Ser 


He 


Phe 


Arg 


Arg 






ion 










185 










190 






Gly Leu 


va J. 


Lys 


va J. 




Gin 


Thr 


Thr 


Glv 


Ala 


Trp 


X xe 


He 


Thr 


Glv 














200 


















Gly Ser 


His 


rp l_ ~. 

inr 


oj.y 




Irlkz L. 


Lys 


Gin 


Val 


Glv 


Glu 


Ala 


Val 


/4xtJ 


Asp 




210 










215 










o on 










Phe 


Ser 


Leu 


Cat* 


o c 1. 


OcX. 


lyjT 


Lys 


Glu 


Glv 


Glu 


Leu 


He 


Thr 


He 


Glv 


225 










230 










235 










240 


Val 


Ala 


rp V", f 

i nr 


Trp 


oxy 




Val 


His 


T^g 


Arg 


Glu 


Gly 


Leu 


He 


His 


Pro 








245 










250 










255 




Thr 


Gly 


Ser 




Pro 


Ala 


Glu 


Tyr 


He 


Leu 


ASD 


Glu 


Asp 


Gly 


Gin 


Gly 






^ DU 










265 










270 






Asn 


Leu 




Cy s 


Leu 


Asp 


Ser 


Asn 


His 


Ser 


His 


Phe 


He 


Leu 


Val 


Asp 






27 5 








280 










285 








Asp 


Gly 


X nr 


nx 5 




Gin 


Twr 


Gly 


Val 


Glu 


He 


Pro 


Leu 


Arg 


Thr 


Arg 


290 










295 










300 










Leu 


Glu 


Lys 


XT ne 


-L -L t; 


Ser 


Glu 


Gin 


Thr 


Lys 


Glu 


Arg 


Gly 


Gly 


Val 


Ala 


305 








310 










315 










320 


He 


Lys 


He 


Pro 


He 


Val 


Cys 


Val 


Val 


Leu 


Glu 


Gly 


Gly 


Pro 


Gly 


Thr 








325 










330 










335 




Leu 


His 


Thr 


He 


Asp 


Asn 


Ala 


Thr 


Thr 


Asn 


Gly 


Thr 


Pro 


Cys 


Val 


Val 








340 








345 










350 






Val 


Glu 


Gly 


Ser 


Gly 


Arg 


Val 


Ala 


Asp 


Val 


He 


Ala 


Gin 


Val 


Ala 


Asn 






355 










360 








Gin 
380 


365 








Leu 


Pro 
370 


Val 


Ser 


Asp 


He 


Thr 
375 


He 


Ser 


Leu 


He 


Gin 


Lys 


Leu 


Ser 


Val 


Phe 


Phe 


Gin 


Glu 


Met 


Phe 


Glu 


Thr 


Phe 


Thr 


Glu 


Ser 


Arg 


He 


Val 


385 










390 










395 










400 


Glu 


Trp 


Thr 


Lys 


Lys 


He 


Gin 


Asp 


He 


Val 


Arg 


Arg 


Arg 


Gin 


Leu 


Leu 








405 










410 










415 




Thr 


Val 


Phe 


Arg 
420 


Glu 


Gly 


Lys 


Asp 


Gly 
425 


Gin 


Gin 


Asp 


Val 


Asp 
430 


Val 


Ala 


He 


Leu 


Gin 


Ala 


Leu 


Leu 


Lys 


Ala 


Ser 


Arg 


Ser 


Gin 


Asp 


His 


Phe 


Gly 



435 440 445 
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His 


(aJ-U 


Asn 


Trp 


Asp 






Le u 


Lys 


Leu 


Ala 


Val 


Ala 




Asn 


Arg 
























4 60 










vai 


Asp 


1 jue 


AX a 


Arg 


Cat- 

oer 


r"i n 

V3±U 


T 1 o 

J. J. 6 






Asp 


Glu 


irp 


Gin 


irp 


Lys 


H DD 


















4 75 










480 


Pro 


Ser 


Asp 


Leu 


ill s 


Pro 


1 nr 




i. fix 




riX d 


L u 
eu 


He 


Ser 


Asn 


Lys 










4 ft 

ft 0 3 




















495 




Pro 


Glu 


fne 


va JL 


Lys 


Leu 


rne 


Leu 


uX U 


Asn 


K3±y 




VaX&l 


Leu 


Lys 


ux u 




























RIO 






Phe 


Val 


Thr 


Trp 


Asp 


Thr 


Leu 


Leu 


Tyr 


Leu 


Tyr 


UXU 


Asn 


Leu 


Asp 


Pro 
















u 










525 








Ser 


Cys 


Leu 


Phe 


His 


Ser 


Lys 


Leu 


Vain 


Lys 


va± 


Leu 


va J. 


OXU 


Asp 


Pro 




530 










R "3 CL 




















Glu 


Arg 


Pro 


Ala 


Cys 


Ala 


Pro 


Ala 


Aia 


Pro 


Arg 


Leu 


K3±n 


nei. 


nxs 


nxs 


545 










c c rt 




















D DU 


Val 


Ala 


Gin 


Val 


Leu 


Arg 


Glu 


Leu 


Leu 


Gly 


Asp 


n _ 

fne 


Thr 


Liin 


Pro 


Leu 










565 










O iU 










O / D 




Tyr 


Prjo 


Arg 


Pro 


Arg 


His 


Asn 


Asp 


Arg 


Leu 


Arg 


Leu 


Leu 


Leu 


Pro 


Val 








580 










585 
















Pro 


His 


Val 


Lys 


Leu 


Asn 


Val 


Gin 


Gly 


Val 


Ser 


Leu 


Arg 


Ser 


Leu 


Tyr 






595 










600 










bU D 








Lys 


Arg 


Ser 


Ser 


Gly 


His 


Val 


Thr 


Phe 


Thr 


Met 


Asp 


Pro 


lie 


Arg 


Asp 




610 










615 










con 










Leu 


Leu 


He 


Trp 


Ala 


He 


Val 


Gin 


Asn 


Arg 


Arg 


Glu 


Leu 


Ala 


Gly 


He 


625 








630 










£ "3 C 
D Jo 










D4 u 


lie 


Trp 


Ala 


Gin 


Ser 


Gin 


Asp 


Cys 


He 


Ala 


Ala 


Ala 


Leu 


Ala 


Cys 


Ser 










645 










O50 










K>DO 




Lys 


He 


Leu 


Lys 


Glu 


Leu 


Ser 


Lys 


Glu 


Glu 


Glu 


Asp 


Thr 


Asp 


C A 

ser 


C A V- 

k>er 








660 




















670 






Glu 


Glu 


Met 


Leu 


Ala 


Leu 


Ala 


Glu 


Glu 


Tyr 


Glu 


His 


Arg 


Ala 


xie 


tjiy 
















DO u 










685 








Val 


Phe 


Thr 


Glu 


Cys 


Tyr 


Arg 


Lys 


Asp 


Glu 


Glu 


Arg 


Ala 


Gin 


Lys 


T Ami 

lieu 




690 




















*7nn 
/ uu 










Leu 


Thr 


Arg 


Val 


Ser 


Glu 


Ala 


Trp 


Gly 


Lys 


Thr 


Thr 


Cys 


Leu 


oin 


Leu 


705 










710 










T 1 ^ 
I XD 












Ala 


Leu 


Glu 


Ala 


Lys 


Asp 


Met 


Lys 


fne 


vai 


Ser 


Ills 


Gly Gly 


T 1 o 
XXe 


oxn 










725 










Tin 










f JU 




Ala 


Phe 


Leu 


Thr 


Lys 


Val 


Trp 


Trp 


Caly 


(jin 


Leu 


•ber 


Val 


Asp 


Asn 


iaxy 








n A n 




















750 






Leu 


Trp 


Arg 


Val 


Thr 


Leu 


Cys 


Met 


Leu 


A±a 


fne 


Pro 


Leu 


Leu 


Leu 


X fix 
















7 60 










765 








Gly 


Leu 


He 


Ser 


Phe 


Arg 


Glu 


Lys 


Arg 


Leu 


oin 


Asp 


Val 


Gly 


1 iii 


Pro 




~i ~t n 
/ 7 0 










1 f Zi 










7 R n 










Ala 


Ala 


Arg 


Ala 


Arg 


Ala 


Phe 


trne 


Thr 


Ala 


fro 


va± 


Val 


Val 


fne 


XIXS 


T D 
/ O J 






























800 


Leu 


Asn 


He 


Leu 


Ser 


Tyr 


Fne 


Ala 


xrne 


Leu 


Cys 


Leu 


Phe 


Ala 


Tyr 


V dX 










805 










D T n 
o ±u 










O X ^ 




Leu 


Met 


va± 


Asp 


Phe 


fin 


Pro 




Pro 


r 


Trp 


Cys 


Glu 


Cys 


Ala 


He 








820 










825 










830 






Tyr 


Leu 


Trp 


Leu 


Phe 


Ser 


Leu 


Val 


Cys 


Glu 


Glu 


Met 


Arg 


Gin 


Leu 


Phe 






835 










840 










845 








Tyr 


Asp 


Pro 


Asp 


Glu 


Cys 


Gly 


Leu 


Met 


Lys 


Lys 


Ala 


Ala 


Leu 


Tyr 


Phe 




850 










855 










860 










Ser 


Asp 


Phe 


Trp 


Asn 


Lys 


Leu 


Asp 


Val 


Gly 


Ala 


He 


Leu 


Leu 


Phe 


Val 


865 










870 










875 










880 


Ala 


Gly 


Leu 


Thr 


Cys 


Arg 


Leu 


He 


Pro 


Ala 


Thr 


Leu 


Tyr 


Pro 


Gly 


Arg 










885 










8 90 










895 




Val 


He 


Leu 


Ser 


Leu 


Asp 


Phe 


He 


Leu 


Phe 


Cys 


Leu 


Arg 


Leu 


Met 


His 








900 










905 










910 






He 


Phe 


Thr 


He 


Ser 


Lys 


Thr 


Leu 


Gly 


Pro 


Lys 


He 


He 


He 


Val 


Lys 



915 920 925 
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Arg Met Met Lys Asp Val Phe Phe Phe Leu Phe Leu Leu Ala Val Trp 

930 935 940 

Val Val Ser Phe Gly Val Ala Lys Gin Ala lie Leu lie His Asn Glu 
945 950 955 960 

Arg Arg Val Asp Trp Leu Phe Arg Gly Ala Val Tyr His Ser Tyr Leu 

965 970 975 

Thr lie Phe Gly Gin lie Pro Gly Tyr lie Asp Gly Val Asn Phe Asn 

980 985 990 

Pro Glu His Cys Ser Pro Asn Gly Thr Asp Pro Tyr Lys Pro Lys Cys 

995 1000 1005 

Pro Glu Ser Asp Ala Thr Gin Gin Arg Pro Ala Phe Pro Glu Trp Leu 

1010 1015 1020 

Thr Val Leu Leu Leu Cys Leu Tyr Leu Leu Phe Thr Asn lie Leu Leu 
1025 1030 1035 104 

Leu Asn Leu Leu lie Ala Met Phe Asn Tyr Thr Phe Gin Gin Val Gin 

1045 1050 1055 

Glu His Thr Asp Gin lie Trp Lys Phe Gin Arg His Asp Leu lie Glu 

1060 1065 1070 

Glu Tyr His Gly Arg Pro Ala Ala Pro Pro Pro Phe lie Leu Leu Ser 

1075 1080 1085 

His Leu Gin Leu Phe lie Lys Arg Val Val Leu Lys Thr Pro Ala Lys 

1090 1095 1100 

Arg His Lys Gin Leu Lys Asn Lys Leu Glu Lys Asn Glu Glu Ala Ala 
1105 1110 1115 112 

Leu Leu Ser Trp Glu lie Tyr Leu Lys Glu Asn Tyr Leu Gin Asn Arg 

1125 1130 1135 

Gin Phe Gin Gin Lys Gin Arg Pro Glu Gin Lys lie Glu Asp lie Ser 

1140 1145 1150 

Asn Lys Val Asp Ala Met Val Asp Leu Leu Asp Leu Asp Pro Leu Lys 

1155 1160 1165 

Arg Ser Gly Ser Met Glu Gin Arg Leu Ala Ser Leu Glu Glu Gin Val 

1170 1175 1180 

Ala Gin Thr Ala Arg Ala Leu His Trp lie Val Arg Thr Leu Arg Ala 
1185 1190 1195 120 

Ser Gly Phe Ser Ser Glu Ala Asp Val Pro Thr Leu Ala Ser Gin Lys 

1205 1210 1215 

Ala Ala Glu Glu Pro Asp Ala Glu Pro Gly Gly Arg Lys Lys Thr Glu 

1220 1225 1230 

Glu Pro Gly Asp Ser Tyr His Val Asn Ala Arg His Leu Leu Tyr Pro 

1235 1240 1245 

Asn Cys Pro Val Thr Arg Phe Pro Val Pro Asn Glu Lys Val Pro Trp 

1250 1255 1260 

Glu Thr Glu Phe Leu lie Tyr Asp Pro Pro Phe Tyr Thr Ala Glu Arg 
1265 1270 1275 128 

Lys Asp Ala Ala Ala Met Asp Pro Met Gly Asp Thr Leu Glu Pro Leu 

1285 1290 1295 

Ser Thr lie Gin Tyr Asn Val Val Asp Gly Leu Arg Asp Arg Arg Ser 

1300 1305 1310 

Phe His Gly Pro Tyr Thr Val Gin Ala Gly Leu Pro Leu Asn Pro Met 

1315 1320 1325 

Gly Arg Thr Gly Leu Arg Gly Arg Gly Ser Leu Ser Cys Phe Gly Pro 

1330 * 1335 1340 

Asn His Thr Leu Tyr Pro Met Val Thr Arg Trp Arg Arg Asn Glu Asp 
1345 1350 1355 136 

Gly Ala lie Cys Arg Lys Ser lie Lys Lys Met Leu Glu Val Leu Val 

1365 1370 1375 

Val Lys Leu Pro Leu Ser Glu His Trp Ala Leu Pro Gly Gly Ser Arg 

1380 1385 1390 

Glu Pro Gly Glu Met Leu Pro Arg Lys Leu Lys Arg lie Leu Arg Gin 
1395 1400 1405 



wo 00/40614 PCTAJS99/29996 

-18- 

Glu His Trp Pro Ser Phe Glu Asn Leu Leu Lys Cys Gly Met Glu Val 

1410 1415 1420 

Tyr Lys Gly Tyr Met Asp Asp Pro Arg Asn Thr Asp Asn Ala Trp lie 
1425 1430 1435 144 

Glu Thr Val Ala Val Ser Val His Phe Gin Asp Gin Asn Asp Val Glu 

1445 1450 1455 

Leu Asn Arg Leu Asn Ser Asn Leu His Ala Cys Asp Ser Gly Ala Ser 

1460 1465 1470 

lie Arg Trp Gin Val Val Asp Arg Arg lie Pro Leu Tyr Ala Asn His 

1475 1480 1485 

Lys Thr Leu Leu Gin Lys Ala Ala Ala Glu Phe Gly Ala His Tyr 
1490 1495 1500 
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<400> 13 



Met 


lie 


Thr 


Asp 


Lys 


Asn 


Leu 


Phe 


Ser 


Arg 


Leu 


Leu 


He 


Lys 


Lys 


Asn 


1 








5 










10 










15 




Pro 


lie 


Arg 


Met 


His 


Ser 


Pro 


Ser 


Phe 


Ser 


Phe 


Ser 


Leu 


He 


Thr 


Ser 






20 










25 










30 






Leu 


Phe 


Phe 


Thr 


Gin 


Phe 


Phe 


Met 


Phe 


Gin 


Leu 


Ser 


Ser 


Met 


Ala 


Tyr 






35 










40 










45 








Phe 


Phe 


Leu 


Thr 


Leu 


He 


Ala 


Gly 


Val 


Thr 


His 


Phe 


Tyr 


Phe 


Pro 


Glu 




50 










55 










60 










Lys 


Leu 


Leu 


Gly 


Lys 


Ser 


Glu 


Asn 


Leu 


Asp 


His 


Arg 


Tyr 


Gin 


Ser 


Ser 


65 










70 










75 










80 


LiXU 


(jxn 


Lys 


vaj. 


Leu 


xxe 




Trp 


Thr 


Glu 


Asn 


Lys 


Ala 


Val 


Ala 


Glu 










85 










90 










95 




Ser 


Leu 


Arg 


Ala 


Asn 


Ser 


Val 


Thr 


Val 


Glu 


Glu 


Asn 


Glu 


Ser 


Glu 


Arg 






100 










105 










110 






Glu 


Thr 


Glu 


Thr 


Gin 


Thr 


Lys 


Arg 


Arg 


Arg 


Lys 


Lys 


Gin 


Arg 


Ser 


Thr 






115 










120 










125 








Ser 


Ser 


Asp 


Lys 


Ala 


Pro 


Leu 


Asn 


Ser 


Ala 


Pro 


Arg 


His 


Val 


Gin 


Lys 




130 










135 










140 










Phe 


Asp 


Trp 


Lys 


Asp 


Met 


Leu 


His 


Leu 


Ala 


Asp 


He 


Ser 


Gly 


Arg 


Lys 


145 










150 










155 










160 


Arg 


Gly 


Asn 


Ser 


Thr 


Thr 


Ser 


His 


Ser 


Gly 


His 


Ala 


Thr 


Arg 


Ala 


Gly 










165 










170 










175 




Ser 


Leu 


Lys 


Gly 


Lys 


Asn 


Trp 


He 


Glu 


Cys 


Arg 


Leu 


Lys 


Met 


Arg 


Gin 








180 










185 










190 






Cys 


Ser 


Tyr 


Phe 


Val 


Pro 


Ser 


Gin 


Arg 


Phe 


Ser 


Glu 


Arg 


Cys 


Gly 


Cys 






195 










200 










205 








Gly 


Lys 


Glu 


Arg 


Ser 


Lys 


His 


Thr 


Glu 


Glu 


Val 


Leu 


Glu 


Arg 


Ser 


Gin 




210 










215 










220 










Asn 


Lys 


Asn 


His 


Pro 


Leu 


Asn 


His 


Leu 


Thr 


Leu 


Pro 


Gly 


He 


His 


Glu 


225 










230 










235 










240 


Val 


Asp 


Thr 


Thr 


Asp 


Ala 


Asp 


Ala 


Asp 


Asp 


Asn 


Glu 


Val 


Asn 


Leu 


Thr 










245 










250 










255 




Pro 


Gly 


Arg 


Trp 


Ser 


He 


Gin 


Ser 


His 


Thr 


Glu 


He 


Val 


Pro 


Thr 


Asp 








260 










265 










270 






Ala 


Tyr 


Gly 


Asn 


He 


Val 


Phe 


Glu 


Gly 


Thr 


Ala 


His 


His 


Ala 


Gin 


Tyr 






275 










280 










285 








Ala 


Arg 


lie 


Ser 


Phe 


Asp 


Ser 


Asp 


Pro Arg Asp 


He 


Val 


His 


Leu 


Met 




290 










295 










300 










Met 


Lys 


Val 


Trp 


Lys 


Leu 


Lys 


Pro 


Pro 


Lys 


Leu 


He 


He 


Thr 


He 


Asn 


305 










310 










315 










320 


Gly 


Gly 


Leu 


Thr 


Lys 


Phe 


Asp 


Leu 


Gin 


Pro 


Lys 


Leu 


Ala 


Arg 


Thr 


Phe 
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325 



Airg 


Lys 


\3±y 


T 1 


Met 


Lys 














1 nxr 


Ser 


fz^ \i 

O-L y 


Leu 


Asp 


Glu 














His 


7\ 1 -a 

J\±3i 


Leu 










J /u 










Asp 


vax 


Leu 


Leu 


Ala 


His 


385 












Asp 


His 


(jiy 


Asn 


11 J. S 


1 nxr 














Ala 


Ser 


Trp 


Gi y 


Met 


Leu 








>i o n 






Ser 


Thr 


vax 


i nr 


Tyr 








A C 

4 








Glu 


Leu 


Asn 


Asp 


Asn 


His 




4 50 










Val 


Asn 


Arg 


Tyr 




AX a 


4 65 










4 /u 


Tyr 


Leu 


Ala 


Gin 


Gly 


*Asp 










/toe 




Val 


Val 


Leu 


Glu 


vjXy 


Gly 








c r» n 
500 






Val 


Thr 


1 nr 


X ±e 


Pro 


Arg 






olo 








Arg 


Ala 


Ala 


Asp 


±xe 


Leu 




530 










Gly 


Phe 


Leu 


C A v> 

ber 


Asp 


Asn 












33 u 


Axg 


lie 


fne 


oJiy 


Tyr 


Asp 














lie 


Val 


Glu 


Cys 


C A V 

iber 


1 nr 














oAy 




Car* 

oer 


oer 


Arg 


w J- u 






R Q Q 








Leu 


Lys 


t»iy 


oxn 


Asn 


Leu 




610 










Ala 


Trp 


Asn 


Arg 


A±a 


Asp 


£ o c 










D 


Thr 


Glu 


Trp 


1 nr 


1 nr 


isxn 














Ser 


Asn 


Asp 


Arg 


lie 


Asp 








boU 






Ser* 


nen 


u±n 


Lys 




Leu 






D / o 








Thr 


Asp 


Lys 


V3±y 


Pro 


oxn 




d90 










Ser 


Lys 


His 


His 


lie 


Lys 


705 










710 


Leu 


Met 


Gly 


Asn 


Leu 


Tyr 










725 




Asn 


Gin 


Tyr 


Phe 


Leu 


Phe 








740 






His 


Ser 


Asn 


Ser 


Asn 


Gly 






755 








Asp 


Ala 


Gly 


Arg 


Glu 


Arg 




770 










Asn 


Ala 


Arg 


Asn 


Ser 


He 


785 










790 


Arg 


Glu 


Ser 


Asp Asp 


Glu 



-19- 

330 



He 


Ala 


Lvs 


Ser 


Thr 


Asp 


Ala 
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Tyr 


Thr 


Lys 


Lys 


Val 






835 










840 










845 
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Thr lie lie Ser Ser Arg Lys Asn Ser Gly Val Ala Ser Val Tyr Gly 

850 855 860 

Ser Ala Ser Ser Met Met Phe Lys Arg Glu Pro Gin Leu Asn Lys Phe 
865 870 875 880 

Giu Arg Phe Arg Ala Phe Tyr Ser Ser Pro lie Thr Lys Phe Trp Ser 

885 890 895 

Trp Cys He Ala Phe Leu He Phe Leu Thr Thr Gin Thr Cys He Leu 

900 905 910 

Leu Leu Glu Thr Ser Leu Lys Pro Ser Lys Tyr Glu Trp He Thr Phe 

915 920 925 

He Tyr Thr Val Thr Leu Ser Val Glu His He Arg Lys Leu Met Thr 

930 935 940 

Ser Glu Gly Ser Arg He Asn Glu Lys Val Lys Val Phe Tyr Ala Lys 
945 950 955 960 

Trp Tyr Asn He Trp Thr Ser Ala Ala Leu Leu Phe Phe Leu Val Gly 

965 970 975 

Tyr Gly Phe Arg Leu Val Pro Met Tyr Arg His Ser Trp Gly Arg Val 

980 985 990 

Leu Leu Ser Phe Ser Asn Val Leu Phe Tyr Met Lys He Phe Glu Tyr 

995 1000 1005 

Leu Ser Val His Pro Leu Leu Gly Pro Tyr He Gin Met Ala Ala Lys 

1010 1015 1020 

Met Val Trp Ser Met Cys Tyr He Cys Val Leu Leu Leu Val Pro Leu 
1025 1030 1035 104 

Met Ala Phe Gly Val Asn Arg Gin Ala Leu Thr Glu Pro Asn Val Lys 

1045 1050 1055 

Asp Trp His Trp Leu Leu Val Arg Asn He Phe Tyr Lys Pro Tyr Phe 

1060 1065 1070 

Met Leu Tyr Gly Glu Val Tyr Ala Gly Glu He Asp Thr Cys Gly Asp 

1075 1080 1085 

Glu Gly He Arg Cys Phe Pro Gly Tyr Phe He Pro Pro Leu Leu Met 

1090 1095 1100 

Val He Phe Leu Leu Val Ala Asn He Leu Leu Leu Asn Leu Leu He 
1105 1110 1115 112 

Ala He Phe Asn Asn He Tyr Asn Asp Ser lie Glu Lys Ser Lys Glu 

1125 1130 1135 

He Trp Leu Phe Gin Arg Tyr Gin Gin Leu Met Glu Tyr His Asp Ser 

1140 1145 1150 

Pro Phe Leu Pro Pro Pro Phe Ser He Phe Ala His Val Tyr His Phe 

1155 1160 1165 

He Asp Tyr Leu Tyr Asn Leu Arg Arg Pro Asp Thr Lys Arg Phe Arg 

1170 1175 1180 

Ser Glu His Ser He Lys Leu Ser Val Thr Glu Asp Glu Met Lys Arg 
1185 1190 1195 120 

He Gin Asp Phe Glu Glu Asp Cys He Asp Thr Leu Thr Arg He Arg 

1205 1210 1215 

Lys Leu Lys Leu Asn Thr Lys Glu Pro Leu Ser Val Thr Asp Leu Thr 

1220 1225 1230 

Glu Leu Thr Cys Gin Arg Val His Asp Leu Met Gin Glu Asn Phe Leu 

1235 1240 1245 

Leu Lys Ser Arg Val Tyr Asp He Glu Thr Lys He Asp His He Ser 

1250 1255 1260 

Asn Ser Ser Asp Glu Val Val Gin He Leu Lys Asn Lys Lys Leu Ser 
1265 1270 1275 128 

Gin Asn Phe Ala Ala Ser Ser Leu Ser Leu Pro Asp Thr Ser He Glu 

1285 1290 1295 

Val Pro Lys He Thr Lys Thr Leu He Asp Cys His Leu Ser Pro Val 

1300 1305 1310 

Ser He Glu Asp Arg Leu Ala Thr Arg Ser Pro Leu Leu Ala Asn Leu 
1315 1320 1325 
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Gln Arg Asp Kis Thr Leu Arg Lys Leu Pro Thr Trp Glu Thr Ser Thr 

1330 1335 1340 

Ala Ser Thr Ser Ser Phe Glu Phe Val Phe Tyr Phe Thr Arg His Glu 
1345 1350 1355 136 

Gly Asn Glu Asn Lys Tyr Glu Phe Lys Lys Leu Glu Lys Gly Gly Phe 

1365 1370 1375 

Trp Arg Asn Asn Tyr Val lie Ser Trp Arg Leu 
1380 1385 



<210> 15 
<211> 1868 
<212> PRT 
<213> C. Elegans 





<400> 


I O 










Met 


Asn 


Leu 


Cys 
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Arg 
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Trp 
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1 nr 


Met 
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Ser 


Asp 
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Ser 


Lys 


Gly 
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Gin 


Val 


Pro 






o c 










H U 


Ala 


Gly 


Gly 


Asp 


Gly 


Asn 


Ala 


vax 




50 










55 




Gin 


rp Wi w- 

Tnr 


rne 


Asn 


Ser 


Gly 


Arg 




DO 










70 






Arg 


Leu 


Asn 


Glu 


Asp 


Val 


Ser 


/\± a. 


















Leu 


Pro 


inr 


Pro 


Lieu 


Phe 


Asn 










xuu 










Ser 


Leu 


Asn 


Arg 


nlS 


Asn 


Trp 


Val 
















120 


Cys 


Ser 


Arg 


fne 


xxe 


Ala 


Ser 


Ser 




1 "in 










135 




G±y 


Arg 


Thr 


Arg 


Asp 


Ala 


His 


Arg 


145 










150 






Phe 


Leu 


Arg 


Gin 


Lys 


Arg 


Ser 


Val 










165 








lie 


Ser 


Asn 


Val 


Asn 


Asp Asp 


He 








180 










Arg 


Gly 


Ala 


Asn 


Glu 


Lys 


Trp 


Ser 






195 










200 


Ala 


Thr 


Asn 


Ala 


Phe 


Gly 


Gin 


Val 




210 










215 




Tyr 


Lys 


Ala 


Gin 


Tyr 


Val 


Arg 


Val 


225 










230 






He 


Met 


Ser 


Leu 


Phe 


Glu 


His 


Val 










245 








He 


He 


Thr 


Val 


His 


Gly 


Gly 


Thr 








260 










Leu 


Ala 


Arg 


Val 


Phe 


Arg 


Lys 


Gly 






275 










280 


Gly 


Ala 


Trp 


He 


He 


Thr 


Ser 


Gly 




290 










295 




Val 


Ala 


Ala 


Ala 


Leu 


Glu 


Gly Ala 


305 










310 






Val 


Cys 


He 


Gly 


He 


Ala 


Pro 


Trp 










325 








Phe 


He 


Gly 


Gin 


Asp 


Lys 


Thr 


Val 








340 










Gly 


Arg 


Phe 


Thr 


Gly 


Leu 


Asn 


Asn 



Arg 


T* \r r" 

1 yr 


Ala 


Ser 


Ser 


Pro 


Glu 


Val 




1 0 










15 




Leu 


O-L y 


Val 


Thr 


Arg 




Leu 


Gin 












30 






Pro 


Th-r 


Ser 


Thr 


Thr 


Thr 


Gly 


Glv 

o j.y 










45 








Pro 


Thr 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 








60 










1 IIX 


Thr 


Gly 


Met 


Ser 


Ser 


Gl V 


Asp 






7 S 










80 
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Ala 


Asn 


Ser 


Ala 
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Leu 


Val 
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Met 


Arg 


Phe 
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Glu 


Ser 
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Met 
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Arg 


Glu 
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Ara 


Arg 


Glu 
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Arg 


Asp 


Leu 
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Lvs 


Cys 


Gly 


Cys 
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Ser 


Glu 
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160 


Ala 


Ala 


Leu 


Glu 


Gin 
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Arg 


Ser 
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Gin 
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Met 
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Arg 


Lys 
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Ser 


Leu 
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Gin 


Gly 


Gly 


Pro 


His 
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Glu 


Pro 


Ala 
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Trp 
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Ser 
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Pro 


Arg 


Leu 
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Pro 


Lys 
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Ser 


Thr 


Thr 
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Asp 
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Gly 


Val 
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His 
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Ser 


Ala 
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Arg 
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Lys 


He 
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Gly 


Leu 
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Lys 


Lys 


Arg 


Glu 


Asp 
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Pro 


Tyr 


Tyr 


Pro 


Ser 


Ser 


Ser 


Lys 


345 










350 






Arg 


His 


Ser 


Tyr 


Phe 


Leu 


Leu 


Val 
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He 
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Arg 
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Met 
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His 
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Met 
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Arg 
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835 84C 845 

Ala Glu Glu Phe Arg lie Leu Ser Leu Glu Leu Leu Asp His Cys Tyr 

850 855 860 

His Val Asp Asp Ala Gin Thr Leu Gin Leu Leu Thr Tyr Glu Leu Ser 
865 870 875 880 

Asn Trp Ser Asn Glu Thr Cys Leu Ala Leu Ala Val lie Val Asn Asn 

885 890 895 

Lys His Phe Leu Ala His Pro Cys Cys Gin lie Leu Leu Ala Asp Leu 

900 905 910 

Trp His Gly Gly Leu Arg Met Arg Thr His Ser Asn He Lys Val Val 

915 920 925 

Leu Gly Leu He Cys Pro Pro Phe He Gin Met Leu Glu Phe Lys Thr 

930 935 940 

Arg Glu Glu Leu Leu Asn Gin Pro Gin Thr Ala Ala Glu His Gin Asn 
945 950 955 960 

Asp Met Asn Tyr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 

965 970 975 

Ser Ser Ser Ser Ser Asp Ser Ser Ser Phe Glu Asp Asp Asp Asp Glu 

980 985 990 

Asn Asn Ala His Asn His Asp Gin Lys Arg Thr Arg Lys Thr Ser Gin 

995 1000 1005 

Gly Ser Ala Gin Ser Leu Asn lie Thr Ser Leu Phe His Ser Arg Arg 

1010 1015 1020 

Arg Lys Ala Lys Lys Asn Glu Lys Cys Asp Arg Glu Thr Asp Ala Ser 
1025 1030 1035 104 

Ala Cys Glu Ala Gly Asn Arg Gin He Gin Asn Gly Gly Leu Thr Ala 

1045 1050 1055 

Glu Tyr Gly Thr Phe Gly Glu Ser Asn Gly Val Ser Pro Pro Pro Pro 

1060 1065 1070 

Tyr Met Arg Ala Asn Ser Arg Ser Arg Tyr Asn Asn Arg Ser Asp Met 

1075 1080 1085 

Ser Lys Thr Ser Ser Val He Phe Gly Ser Asp Pro Asn Leu Ser Lys 

1090 1095 1100 

Leu Gin Lys Ser Asn He Thr Ser Thr Asp Arg Pro Asn Pro Met Glu 
1105 1110 1115 112 

Gin Phe Gin Gly Thr Arg Lys He Lys Met Arg Arg Arg Phe Tyr Glu 

1125 1130 1135 

Phe Tyr Ser Ala Pro He Ser Thr Phe Trp Ser Trp Thr He Ser Phe 

1140 1145 1150 

He Leu Phe He Thr Phe Phe Thr Tyr Thr Leu Leu Val Lys Thr Pro 

1155 1160 1165 

Pro Arg Pro Thr Val He Glu Tyr He Leu He Ala Tyr Val Ala Ala 

1170 1175 1180 

Phe Gly Leu Glu Gin Val Arg Lys He He Met Ser Asp Ala Lys Pro 
1185 1190 1195 120 

Phe Tyr Glu Lys He Arg Thr Tyr Val Cys Ser Phe Trp Asn Cys Val 

1205 1210 1215 

Thr He Leu Ala He He Phe Tyr He Val Gly Phe Phe Met Arg Cys 

1220 1225 1230 

Phe Gly Ser Val Ala Tyr Gly Arg Val He Leu Ala Cys Asp Ser Val 

1235 1240 1245 

Leu Trp Thr Met Lys Leu Leu Asp Tyr Met Ser Val His Pro Lys Leu 

1250 1255 1260 

Gly Pro Tyr Val Thr Met Ala Gly Lys Met He Gin Asn Met Ser Tyr 
1265 1270 1275 128 

He He Val Met Leu Val Val Thr Leu Leu Ser Phe Gly Leu Ala Arg 

1285 1290 1295 

Gin Ser He Thr Tyr Pro Asp Glu Thr Trp His Trp He Leu Val Arg 

1300 1305 1310 

Asn He Phe Leu Lys Pro Tyr Phe Met Leu Tyr Gly Glu Val Tyr Ala 
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1315 1320 1325 

Asp Glu He Asp Thr Cys Gly Asp Glu Ala Trp Asp Gin His Leu Glu 

1330 1335 1340 

Asn Gly Gly Pro Val He Leu Gly Asn Gly Thr Thr Gly Leu Ser Cys 
1345 1350 1355 136 

Val Pro Gly Tyr Trp He Pro Pro Leu Leu Met Thr Phe Phe Leu Leu 

1365 1370 1375 

He Ala Asn He Leu Leu Met Ser Met Leu He Ala He Phe Asn His 

1380 1385 1390 

He Phe Asp Ala Thr Asp Glu Met Ser Gin Gin He Trp Leu Phe Gin 

1395 1400 1405 

Arg Tyr Lys Gin Val Met Glu Tyr Glu Ser Thr Pro Phe Leu Pro Pro 

1410 1415 1420 

Pro Leu Thr Pro Leu Tyr His Gly Val Leu He Leu Gin Phe Val Arg 
1425 1430 1435 144 

Thr Arg Leu Ser Cys Ser Lys Ser Gin Glu Arg Asn Pro He Leu Leu 

1445 1450 1455 

Leu Lys He Ala Glu Leu Phe Leu Asp Asn Asp Gin He Glu Lys Leu 

1460 1465 1470 

His Asp Phe Glu Glu Asp Cys Met Glu Asp Leu Ala Arg Gin Lys Leu 

1475 1480 1485 

Asn Glu Lys Asn Thr Ser Asn Glu Gin Arg He Leu Arg Ala Asp He 

1490 1495 1500 

Arg Thr Asp Gin He Leu Asn Arg Leu He Asp Leu Gin Ala Lys Glu 
1505 1510 1515 152 

Ser Met Gly Arg Asp Val He Asn Asp Val Glu Ser Arg Leu Ala Ser 

1525 1530 1535 

Val Glu Lys Ala Gin Asn Glu He Leu Glu Cys Val Arg Ala Leu Leu 

1540 1545 1550 

Asn Gin Asn Asn Ala Pro Thr Ala He Gly Arg Cys Phe Ser Pro Ser 

1555 1560 1565 

Pro Asp Pro Leu Val Glu Thr Ala Asn Gly Thr Pro Gly Pro Leu Leu 

1570 1575 1580 

Leu Lys Leu Pro Gly Thr Asp Pro He Leu Glu Glu Lys Asp His Asp 
1585 1590 1595 160 

Ser Gly Glu Asn Ser Asn Ser Leu Pro Pro Gly Arg He Arg Arg Asn 

1605 1610 1615 

Arg Thr Ala Thr He Cys Gly Gly Tyr Val Ser Glu Glu Arg Asn Met 

1620 1625 1630 

Met Leu Leu Ser Pro Lys Pro Ser Asp Val Ser Gly He Pro Gin Gin 

1635 1640 1645 

Arg Leu Met Ser Val Thr Ser Met Asp Pro Leu Pro Leu Pro Leu Ala 

1650 1655 1660 

Lys Leu Ser Thr Met Ser He Arg Arg Arg His Glu Glu Tyr Thr Ser 
1665 1670 1675 168 

He Thr Asp Ser He Ala He Arg His Pro Glu Arg Arg He Arg Asn 

1685 1690 1695 

Asn Arg Ser Asn Ser Ser Glu His Asp Glu Ser Ala Val Asp Ser Glu 

1700 1705 1710 

Gly Gly Gly Asn Val Thr Ser Ser Pro Arg Lys Arg Ser Thr Arg Asp 

1715 1720 1725 

Leu Arg Met Thr Pro Ser Ser Gin Val Glu Glu Ser Thr Ser Arg Asp 

1730 1735 1740 

Gin He Phe Glu He Asp His Pro Glu His Glu Glu Asp Glu Ala Gin 
1745 1750 1755 176 

Ala Asp Cys Glu Leu Thr Asp Val He Thr Glu Glu Glu Asp Glu Glu 

1765 1770 1775 

Glu Asp Asp Glu Glu Asp Asp Ser His Glu Arg His His He His Pro 

1780 1785 1790 

Arg Arg Lys Ser Ser Arg Gin Asn Arg Gin Pro Ser His Thr Leu Glu 
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1795 1800 1805 

Thr Asp Leu Ser Glu Gly Glu Giu Val Asp Pro Leu Asp Val Leu Lys 

1810 1815 1820 

Met Lys Glu Leu Pro lie lie His Gin lie Leu Asn Glu Glu Glu Gin 
1825 1830 1835 184 

Ala Gly Ala Pro His Ser Thr Pro Val lie Ala Ser Pro Ser Ser Ser 

1845 1850 1855 

Arg Ala Asp Leu Thr Ser Gin Lys Cys Ser Asp Val 
1860 1865 

<210> 16 
<211> 489 
<212> DNA 

<213> Mus Musculus 



<400 
ccctgaaaga 
ccggggctgt 
ggctgcaggc 
agaatgtccc 
ataccaagct 
gtcagatgtt 
atgaaatact 
gtggaaaagc 
ggttctcat 



> 16 
ctcgacttct 
ccttgttcgg 
cgcggaggtg 
agaaatcctg 
ccaaagaccc 
tctgtggtcg 
cagatgtgaa 
acacggagca 



gctgctagcg 
tggcccgtgc 
gaggaggagc 
gatagagagc 
tcacagatgt 
tttggtcaag 
attgggtgaa 
gagcccaaca 



ctggagctga 
caccgcctcc 
cgctgccctt 
actttgacca 
cttccaggat 
caacatgcat 
cactttaacc 
gatgcttatg 



gttagttttg 


agaaggtttc 


60 


ggagacgctt 


tccgatagat 


120 


ccggagtccg 


ccccgtgagg 


180 


agagggagtg 


tgtatatatt 


240 


gtcagatttg 


tcagcaactt 


300 


gctttactgc 


aagtcttgcc 


360 


aggcaataga 


agaatggtct 


420 


gagtcatcaa 


ttttcaaggg 


480 






489 



<210> 17 
<211> 102 
<212> PRT 

<213> Mus Musculus 
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17 


Met 


Ser 


Gin 


Lys 


1 








Val 


Tyr 


lie 


lie 








20 


Cys 


Gin 


lie 


Cys 






35 




Lys 


Gin 


His 


7U.a 




50 






Val 


Lys 


Leu 


Gly 


65 








Glu 


Lys 


His 


Thr 


Phe 


Gin 


Gly 


Gly 



100 



Ser Trp lie Glu 
5 

Pro Ser Ser Lys 

Gin Gin Leu Val 
40 

Cys Phe Thr Ala 
55 

Glu His Phe Asn 
70 

Glu Gin Ser Pro 
85 

Ser His 



Ser Thr Leu Thr 
10 

Asp Pro His Arg 

25 

Arg Cys Phe Cys 

Ser Leu Ala Met 
60 

Gin Ala lie Glu 

75 

Thr Asp Ala Tyr 
90 



Lys Arg Glu Cys 
15 

Cys Leu Pro Gly 
30 

Gly Arg Leu Val 
45 

Lys Tyr Ser Asp 

Glu Trp Ser Val 
80 

Gly Val lie Asn 
95 



<210> 18 
<211> 410 
<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 
<222> (6)... (6) 



<221> unsure 
<222> (58) . . . (58) 



<221> unsure 
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<222> (89) , . . (89) 

<221> unsure 

<222> (406) . . . (406) 



<400> 18 

gccgcnggag cctgagcgga gggtgtgcgc agcctcgcca gcgggggccc cgggctgngc 60 

cattgcctca ctgagccagc gcctgcctnc tacctcgccg acagctggaa ccagtgcgac 120 

ctagtggctc tcacctgctt cctcctgggc gtgggctgcc ggctgacccc gggtttgtac 180 

cacctgggcc gcactgtcct ctgcatcgac ttcatggttt tcacggtgcg gctgcttcac 240 

atcttcacgg tcaacaaaca gctggggccc aagatcgtca tcgtgagcaa gatgatgaag 300 

gacgtgttct tcttcctctt cttcctcggc gtgtggctgg tagctatggg ttgggccacg 360 

gaggggttcc tgaggccacg ggacagtgac ttcccaagta tcctgncgcc 410 



<210> 19 

<211> 131 

<212> PRT 

<213> Homo Sapiens 

<220> 

<221> UNSURE 

<222> (15),. . . (15) 

<223> UNKNOWN 

<221> UNSURE 

<222> (25) . . . (25) 

<223> UNKNOWN 

<221> UNSURE 

<222> (131) . . . (131) 

<223> UNKNOWN 

<400> 19 



Ala 


Glu 


Gly 


Val 


Arg 


Ser 


Leu 


Ala 


Ser 


Gly 


Gly 


Pro 


Gly 


Leu 


Xaa 


His 


1 








5 










10 










15 




Cys 


Leu 


Thr 


Glu 
20 


Pro 


Ala 


Pro 


Ala 


Xaa 
25 


Tyr 


Leu 


Ala 


Asp 


Ser 
30 


Trp 


Asn 


Gin 


Cys 


Asp 
35 


Leu 


Val 


Ala 


Leu 


Thr 
40 


Cys 


Phe 


Leu 


Leu 


Gly 
45 


Val 


Gly 


Cys 


Arg 


Leu 
50 


Thr 


Pro 


Gly 


Leu 


Tyr 
55 


His 


Leu 


Gly 


Arg 


Thr 
60 


Val 


Leu 


Cys 


He 


Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 


Asn 


65 










70 










75 










80 


Lys 


Gin 


Leu 


Gly 


Pro 
85 


Lys 


lie 


Val 


lie 


Val 
90 


Ser 


Lys 


Met 


Met 


Lys 
95 


Asp 


Val 


Phe 


Phe 


Phe 
100 


Leu 


Phe 


Phe 


Leu 


Gly 
105 


Val 


Trp 


Leu 


Val 


Ala 
110 


Met 


Gly 


Trp 


Ala 


Thr 


Glu 


Gly 


Phe 


Leu 


Arg 


Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 


Ser 



115 120 125 



He Leu Xaa 
130 

<210> 20 

<211> 389 

<212> DNA 

<213> Homo Sapiens 

<400> 20 

caaatttttt gttagtacac catctcatcc aaattgcaaa agtcacatgg aaactggaac 60 
caaagatcaa gaaactgttt gctctaaagc tacagaagga gataatacag aatttggagc 120 
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atttgtagga cacagagata gcatggatzt acagaggttt aaagaaacat caaacaagat 180 

aaaaatacta tccaataaca atacttctga aaacactttg aaacgagtga gttctcttgc 24 0 

tggatttact gactgtcaca gaacttccat tcctgttcat tcaaaacgag aaaagatcag 300 

tagaaggcca tctaccgaag acactcatga agtagattcc aaagcagctt taataccggt 360 

ttgtagattt caactaaaca gatatatat 389 

<210> 21 
<211> 415 
<212> DNA 
<213> Homo Sapiens 

<400> 21 

atttctagtt tttcaaattt gccagtcttt 
ttatatttaa aactttttta tgtccatcat 
aaccaataat tccactatct tatcagaaat 
gagttctaaa ttgccattgt gaggtcatct 
tgcagctcag ggtcaggaag agtccctcca 
tgttaactaa cctctttccc cactgaaata 
taatctctct atgccagaac agatatatat 



ttgaatagta tctccttctt 
cactttaaac atacttattt 
caaataccgt ttatgtaagt 
tcggttaggc tttaatttgt 
gaaaggagga tttgttactg 
acttttttca ataacatgat 
gaatgtaagt caatattttc 



ttctcatgtt 60 

tgtcatctat 120 

tgactcccat 180 

tgcaaagttg 240 

tgaatctctt 300 

tttaacaaca 360 

ttgag 415 



<210> 22 
<211> 405 
<212> DNA 
<213> Mus 



Musculus 



<400> 22 

ttattatggc ttatcatgaa aaaccagtcc tgcctcctcc tcttatcatc ctcagccata 60 

tagtttcact gttttgctgt gtatgcaaaa gaagaaagaa agataagact tccgatgggc 120 

caaaactttt cttaacagaa gaagatcaaa agaaactcca tgattttgaa gagcagtgtg 180 

ttgagatgta ctttgatgag aaagatgaca aattcaattc tgggagtgaa gagagaatcc 24 0 

gggtcacttt tgaaagagtg gagcagatga gcattcagat taaagaagtt ggagatcgtg 300 

tcaactacat aaaaagatca ttacagtctt tagattctca aattggtcat ctgcaagatc 360 

tctcagccct aacagtagat acattgaaaa cacttacagc ccaga 4 05 



<210> 23 

<211> 5117 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 

<222> (2382) . . . (2382) 

<223> unknown 



<221> unsure 

<222> (4664) . . . (4664) 

<223> un)cnown 



<221> unsure 

<222> (4682) . . . (4682) 

<223> unknown 



<221> unsure 

<222> (4702) , . . (4702) 

<223> unknown 



<221> 
<222> 
<223> 



unsure 

(5038) ... (5039) 
unknown 
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<221> unsure 

<222> (5056) . . . (5056) 

<223> unknown 

<221> unsure 

<222> (5071) . . . (5072) 



<400> 23 

gatggcaaca tggtgaagaa tcaatggcta aagcattagt tgcctgtaag atctatcgtt 60 

caatggcata tgaagcaaag cagagtgacc tggtagatga tacttcagaa gaactaaaac 120 

agtattccaa tgattttggt cagttggccg ttgaattatt agaacagtcc ttcagacaag 180 

atgaaaccat ggctatgaaa ttgctcactt atgaactgaa gaactggagt aattcaacct 24 0 

'gccttaagtt agcagtttct tcaagactta gaccttttgt agctcacacc tgtacacaaa 300 

tgttgttatc tgatatgtgg atgggaaggc tgaatatgag gaaaaattcc tggtacaagg 360 

tcatactaag cattttagtt ccacctgcca tattgctgtt agagtataaa actaaggctg 420 

aaatgtccca tatcccacaa tctcaagatg ctcatcagat gacaatggat gacagcgaaa 4 80 

acaactttca gaacataaca gaagagatcc ccatggaagt gtttaaagaa gtacggattt 54 0 

tggatagtaa tgaaggaaag aatgagatgg agatacaaat gaaatcaaaa aagcttccaa 600 

ttacgcgaaa gttttatgcc ttttatcatg caccaattgt aaaattctgg tttaacacgt 660 

tggcatattt aggatttctg atgctttata catttgtggt tcttgtacaa atggaacagt 720 

taccttcagt tcaagaatgg attgttattg cttatatttt tacttatgcc attgagaaag 780 

tccgtgagat ctttatgtct gaagctggga aagtaaacca gaagattaaa gtatggttta 84 0 

gtgattactt caacatcagt gatacaattg ccataatttc tttcttcatt ggatttggac 900 

taagatttgg agcaaaatgg aactttgcaa atgcatatga taatcatgtt tttgtggctg 960 

gaagattaat ttactgtctt aacataatat tttggtatgt gcgtttgcta gattttctag 1020 

ctgtaaatca acaggcagga ccttatgtaa tgatgattgg aaaaatggtg gccaatatgt 1080 

tctacattgt agtgattatg gctcttgtat tacttagttt tggtgttccc agaaaggcaa 114 0 

tactttatcc tcatgaagca ccatcttgga ctcttgctaa agatatagtt tttcacccat 1200 

actggatgat ttttggtgaa gtttatgcat acgaaattga tgtgtgtgca aatgattctg 1260 

ttatccctca aatctgtggt cctgggacgt ggttgactcc atttcttcaa gcagtctacc 1320 

tctttgtaca gtatatcatt atggttaatc ttcttattgc atttttcaac aatgtgtatt 1380 

tacaagtgaa ggcaatttcc aatattgtat ggaagtacca gcgttatcat tttattatgg 1440 

cttatcatga gaaaccagtt ctgcctcctc cacttatcat tcttagccat atagtttctc 1500 

tgttttgctg catatgtaag agaagaaaga aagataagac ttccgatgga ccaaaacttt 1560 

tcttaacaga agaagatcaa aagaaacttc atgattttga agagcagtgt gttgaaatgt 1620 

atttcaatga aaaagatgac aaatttcatt ctgggagtga agagagaatt cgtgtcactt 1680 

ttgaaagagt ggaacagatg tgcattcaga ttaaagaagt tggagatcgt gtcaactaca 1740 

taaaaagatc attacaatca ttagattctc aaattggcca tttgcaagat ctttcagccc 1800 

tgacggtaga tacattaaaa acactcactg cccagaaagc gtcggaagct agcaaagttc 1860 

ataatgaaat cacacgagaa ctgagcattt ccaaacactt ggctcaaaac cttattgatg 1920 

atggtcctgt aagaccttct gtatggaaaa agcatggtgt tgtaaataca cttagctcct 1980 

ctcttcctca aggtgatctt gaaagtaata atccttttca ttgtaatatt ttaatgaaag 2040 

atgacaaaga tccccagtgt aatatatttg gtcaagactt acctgcagta ccccagagaa 2100 

aagaatttaa ttttccagag gctggttcct cttctggtgc cttattccca agtgctgttt 2160 

cccctccaga actgcgacag agactacatg gggtagaact cttaaaaata tttaataaaa 2220 

atcaaaaatt aggcagttca tctactagca taccacatct gtcatcccca ccaaccaaat 2280 

tttttgttag tacaccatct cagccaagtt gcaaaagcca cttggaaact ggaaccaaag 234 0 

atcaagaaac tgtttgctct aaagctacag aaggagataa tncagaattt ggagcatttg 24 00 

taggacacag agatagcatg gatttacaga ggtttaaaga aacatcaaac aagataaaaa 2 4 60 

tactatccaa taacaatact tctgaaaaca ctttgaaacg agtgagttct cttgctggat 2520 

ttactgactg tcacagaact tccattcctg ttcattcaaa acaagcagaa aaaatcagta 2580 

gaaggccatc taccgaagac actcatgaag tagattccaa agcagcttta ataccggatt 2640 

ggttacaaga tagaccatca aacagagaaa tgccatctga agaaggaaca ttaaatggtc 2700 

tcacttctcc atttaagcca gctatggata caaattacta ttattcagct gtggaaagaa 2760 

ataacttgat gaggttatca cagagcattc catttacacc tgtgcctcca agaggggagc 2820 

ctgtcacagt gtatcgtttg gaagagagtt cacccaacat actaaataac agcatgtctt 2880 

cttggtcaca actaggcctc tgtgccaaaa tagagttttt aagcaaagag gagatgggag 2940 

gaggtttacg aagagctgtc aaagtacagt gtacctggtc agaacatgat atcctcaaat 3000 

cagggcatct ttatattatc aaatcttttc ttccagaggt ggttaataca tggtcaagta 3060 

tttataaaga agatacagtt ctgcatctct gtctgagaga aattcaacaa cagagagcag 3120 



wo 00/40614 



PCTAJS99y29996 



cacaaaagct 
ggttccttga 
aatgtatgac 
ctaatactct 
gggagttact 
taaaagcaga 
atgcaattaa 
ttccagatct 
cttcagattt 
ttcgtctgat 
aatgttactg 
gcaagttgaa 
atcagtcaga 
taattagtat 
tgttttgatg 
gtccatgcag 
cacgcagcct 
tgatgatatc 
tctaattcta 
cggtatttat 
atagaaccca 
actgttaagt 
acctaactca 
atctgtatct 
tatatcaaat 
ttgaacatga 
anactggcag 
ttgttgaata 
ttttcttttt 
gggagcagta 
gagatgataa 
tgtattactt 
ataaatttat 
gatccctcga 



tacgtttgcc 
agttttcctg 
tggagaattt 
ggaagagatc 
ggtacttgat 
agaaaagaga 
aaacttcaga 
gaagaggaat 
gaatcttcag 
gttataatat 
tgtcactttt 
tttgctgact 
aagtccctat 
ttgtaacttt 
agggtgatgc 
attttattgg 
tttggttaaa 
actggttccc 
ggttcacaga 
gaatcagggt 
aattgtgtaa 
agaagaaagc 
ctcgtcaagg 
atagcaacaa 
cagaatgtaa 
atgctgtttg 
attgcagtgc 
ttcaacaata 
ttttcttttt 
taaagcttat 
gctgtttaaa 
aaagatacta 
tgtttncaaa 
ggggccc 



tttaatcaaa 
ctgtattgcc 
agaaaataca 
atgctagcct 
ttgcaaggtg 
tcctgtgata 
gcaaaacatc 
gattatacgc 
cctggaaatt 
taatattact 
ccctcgggag 
ccagcacagt 
aggatagagc 
ttaaagggct 
cctctcttat 
cagctgtttt 
tggcagtcaa 
aactagatgc 
ggtatgagaa 
taggctccat 
aaatgttaat 
catggtaaac 
gaagttacct 
cccagaactt 
acatgaactc 
ggtacttaag 
anccttatgc 
acaagaaaag 
cttttgtttc 
ttgtatttag 
tagtgtttaa 
tatacgtttt 
aaaaaaaccc 
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tgaaacccaa 
attcagcagg 
acaataataa 
ttagccactg 
ttggtgaaaa 
tggtttttgg 
actgtaattc 
ctgataaaat 
ccaccaaaga 
gaatcattgg 
gaaattgttt 
taaaaggtca 
tggcagctga 
ctttgtatag 
gtggtacaat 
attgccattc 
aattttcctc 
ttgttggcca 
gcctgaactg 
atttaaagat 
tgggtttttt 
tgatacataa 
tttgaggaaa 
acaaacttct 
ttgcatatat 
aaattrattc 
caataaaatg 
cttttcatct 
cttggtacta 
tagtgtatct 
tattgattgg 
gtatatcatt 
nntaaaaaaa 



atccatacca 
acagtggttt 
tggagatgag 
gacttacgaa 
tttgactgac 
cccagcaaat 
ttgctgtaga 
tatatttcct 
atcagaatca 
ttttgcctgc 
ggtaatatag 
atattctttt 
gaaattttaa 
cagaggatct 
accattaacc 
aactagggaa 
agtgtattta 
cgggaaggga 
aagaccattt 
agagccagtt 
aaacattgtt 
cctaaattat 
gttaaagtac 
ccaaagattt 
ttaaaattgt 
agtnggatta 
taatttaaca 
aagttttatg 
attttaattt 
catagataca 
gggtggggag 
aaatctttaa 
aaagggcggc 



tattctccaa 
gctgrggaag 
attattccaa 
tatacaagag 
ccatctgtga 
ctaggagaag 
aagcttaaac 
caggatgagc 
gctaattctg 
acctcacaga 
aaaggtgtat 
gacctgatta 
aggtaattga 
catttgactt 
aaaggtaggt 
atgaagaaat 
gtgtgttcag 
aatgacttgt 
tcaagaggga 
ttttttttaa 
ttatcaagtc 
aaaagcagaa 
ttttttccct 
tattgattgt 
gttggaacat 
tcattatgtg 
gccccagata 
ctttaatttt 
ttatttggaa 
gacaaggcaa 
aaagaaaaag 
aagaaatnna 
ccctctagag 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5117 



<210> 24 
<211> 1224 
<212> PRT 

<213> Homo Sapiens 
<220> 

<221> UNSURE 

<222> (794) . . . (794) 

<223> UNKNOWN 



<400> 24 



Trp 


Gin 


His 


Gly 


Glu 


Glu 


Ser 


Met 


Ala 


Lys 


Ala 


Leu 


Val 


Ala 


Cys 


Lys 


1 








5 










10 










15 




lie 


Tyr 


Arg 


Ser 


Met 


Ala 


Tyr 


Glu 


Ala 


Lys 


Gin 


Ser 


Asp 


Leu 


Val 


Asp 






20 










25 










30 






Asp 


Thr 


Ser 
35 


Glu 


Glu 


Leu 


Lys 


Gin 
40 


Tyr 


Ser 


Asn 


Asp 


Phe 
45 


Gly 


Gin 


Leu 


Ala 


Val 
50 


Glu 


Leu 


Leu 


Glu 


Gin 
55 


Ser 


Phe 


Arg 


Gin 


Asp 
60 


Glu 


Thr 


Met 


Ala 


Met 


Lys 


Leu 


Leu 


Thr 


Tyr 


Glu 


Leu 


Lys 


Asn 


Trp 


Ser 


Asn 


Ser 


Thr 


Cys 


65 








70 










75 










80 


Leu 


Lys 


Leu 


Ala 


Val 


Ser 


Ser 


Arg 


Leu 


Arg 


Pro 


Phe 


Val 


Ala 


His 


Thr 








85 










90 










95 
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Cys 


Thr 


Gin 


Met 


Leu 


Leu 


Ser 


Asp 


Met 


Trp 


Met 


Gly 


Arg 


Leu 


Asn 


Met 






100 










105 










110 






Arg 


Lys 


Asn 


Ser 


Trp 


Tyr 


Lys 


Val 


He 


Leu 


Ser 


He 


Leu 


Val 


Pro 


Pro 




115 










120 










125 








Ala 


He 


Leu 


Leu 


Leu 


Glu 


Tyr 


Lys 


Thr 


Lys 


Ala 


Glu 


Met 


Ser 


His 


He 




130 










135 










140 










Pro 


Gin 


Ser 


Gin 


Asp 


Ala 


His 


Gin 


Met 


Thr 


Met 


Asp 


Asp 


Ser 


Glu 


Asn 


145 










150 










155 










160 


Asn 


Phe 


Gin 


Asn 


He 


Thr 


Glu 


Glu 


He 


Pro 


Met 


Glu 


Val 


Phe 


Lys 


Glu 










165 










170 










175 




Val 


Arg 


He 


Leu 


Asp 


Ser 


Asn 


Glu 


Gly 


Lys 


Asn 


Glu 


Met 


Glu 


He 


Gin 








180 










185 










190 






Met 


Lys 


Ser 


Lys 


Lys 


Leu 


Pro 


He 


Thr 


Arg 


Lys 


Phe 


Tyr 


Ala 


Phe 


Tyr 






195 










200 










205 








His 


Ala 


Pro 


He 


Val 


Lys 


Phe 


Trp 


Phe 


Asn 


Thr 


Leu 


Ala 


Tyr 


Leu 


Gly 




210 










215 










220 










Phe 


Leu 


Met 


Leu 


Tyr 


Thr 


Phe 


Val 


Val 


Leu 


Val 


Gin 


Met 


Glu 


Gin 


Leu 


225 








230 










235 










240 


Pro 


Ser 


Val 


Gin 


Glu 


Trp 


He 


Val 


He 


Ala 


Tyr 


He 


Phe 


Thr 


Tyr 


Ala 










245 










250 










255 




He 


Glu 


Lys 


Val 


Arg 


Glu 


He 


Phe 


Met 


Ser 


Glu 


Ala 


Gly 


Lys 


Val 


Asn 






260 










265 










270 






Gin 


Lys 


He 


Lys 


Val 


Trp 


Phe 


Ser 


Asp 


Tyr 


Phe 


Asn 


He 


Ser 


Asp 


Thr 




275 










280 










285 








He 


Ala 


He 


He 


Ser 


Phe 


Phe 


He 


Gly 


Phe 


Gly 


Leu 


Arg 


Phe 


Gly Ala 




290 










295 










300 










Lys 


Trp 


Asn 


Phe 


Ala 


Asn 


Ala 


Tyr 


Asp 


Asn 


His 


Val 


Phe 


Val 


Ala 


Gly 


305 










310 










315 










320 


Arg 


Leu 


He 


Tyr 


Cys 


Leu 


Asn 


He 


He 


Phe 


Trp 


Tyr 


Val 


Arg 


Leu 


Leu 








325 










330 










335 




Asp 


Phe 


Leu 


Ala 


Val 


Asn 


Gin 


Gin 


Ala 


Gly 


Pro 


Tyr 


Val 


Met 


Met 


He 






340 










345 










350 






Gly 


Lys 


Met 


Val 


Ala 


Asn 


Met 


Phe 


Tyr 


He 


Val 


Val 


He 


Met 


Ala 


Leu 


355 










360 










365 








Val 


Leu 


Leu 


Ser 


Phe 


Gly 


Val 


Pro 


Arg 


Lys 


Ala 


He 


Leu 


Tyr 


Pro 


His 




370 










375 










380 










Glu 


Ala 


Pro 


Ser 


Trp 


Thr 


Leu 


Ala 


Lys 


Asp 


He 


Val 


Phe 


His 


Pro 


Tyr 


385 










390 










395 










400 


Trp 


Met 


He 


Phe 


Gly 


Glu 


Val 


Tyr 


Ala 


Tyr 


Glu 


He 


Asp 


Val 


Cys 


Ala 










405 










410 










415 




Asn 


Asp 


Ser 


Val 


He 


Pro 


Gin 


He 


Cys 


Gly 


Pro 


Gly 


Thr 


Trp 


Leu 


Thr 








420 










425 










430 






Pro 


Phe 


Leu 


Gin 


Ala 


Val 


Tyr 


Leu 


Phe 


Val 


Gin 


Tyr 


He 


He 


Met 


Val 






435 










440 










445 








Asn 


Leu 


Leu 


He 


Ala 


Phe 


Phe 


Asn 


Asn 


Val 


Tyr 


Leu 


Gin 


Val 


Lys 


Ala 




450 










455 










460 










He 


Ser 


Asn 


He 


Val 


Trp 


Lys 


Tyr 


Gin 


Arg 


Tyr 


His 


Phe 


He 


Met 


Ala 


465 










470 










475 










480 


Tyr 


His 


Glu 


Lys 


Pro 


Val 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Leu 


Ser 


His 






485 










490 










4 95 




lie 


Val 


Ser 


Leu 


Phe 


Cys 


Cys 


He 


Cys 


Lys 


Arg 


Arg 


Lys 


Lys 


Asp 


Lys 








500 










505 










510 






Thr 


Ser 


Asp 


Gly 


Pro 


Lys 


Leu 


Phe 


Leu 


Thr 


Glu 


Glu 


Asp 


Gin 


Lys 


Lys 






515 










520 










525 








Leu 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


val 


Glu 


Met 


Tyr 


Phe 


Asn 


Glu 


Lys 




530 








535 










540 










Asp 


Asp 


Lys 


Phe 


His 


Ser 


Gly 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Val 


Thr 


Phe 


545 










550 










555 










560 


Glu 


Arg 


Val 


Glu 


Gin 


Met 


Cys 


He 


Gin 


He 


Lys 


Glu 


Val 


Gly Asp 


Arg 








565 










570 










575 
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Val 


Asn 


Tvr 


lie 


Lvs 


Arg 


Ser 


Leu 


Gin 


Ser 


Leu 


Asp 


Ser 


Gin 


He 


Gly 






580 










585 










590 






His 


Leu 


Gin 


Asp 


Leu 


Ser 


Ala 


Leu 


Thr 


Val 


Asp 


Thr 


Leu 


Lys 


Thr 


Leu 






595 








600 










605 








Thr 


Ala 


Gin 


Lys 


Ala 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


His 


Asn 


Glu 


He 


Thr 




O J. \i 








615 










620 










Arg 


Glu 


Leu 


Ser 


He 


Ser 


Lys 


His 


Leu 


Ala 


Gin 


Asn 


Leu 


He 


Asp 


ASD 


625 










630 










635 










640 


V3j.y 


Pro 


Va 1 

Vo J. 


Arg 


Pro 


OC7X 


Val 


i Xi-» 


Lys 


Lys 


His 


Gly Val 


Val 


Asn 


Thr 










645 










650 










655 




Li6U 


Ser 


Ser 


Ser 


Leu 


Pro 


Gin 


Gly 


Asp 


Leu 


Glu 


Ser 


Asn 


Asn 


Pro 


Phe 








660 










665 










670 






nXS 


Cys 


Asn 




Leu 


Meat- 


ys 


Asp 


AsD 


Lys 


Asp 


Pro 


vjxn 


Cys 


Asn 


He 






u / ^ 










680 










boD 








iriie 


ijp j.y 


r: 1 n 


Asp 


Xjeu 


Pro 


Ala 


Val 


Pro 


Gin 


Arg 


Lys 


vjXU 


Phe 


Asn 


Phe 




690 










695 










T n o 










Pro 




AX a 


vaxy 


Ser 


Ser 


Ser 


Gly 


Ala 


Leu 


Phe 


Pro 


oer 


Ala 


Val 


Ser 


/ w 3 








710 










715 










720 


irJ. O 


fro 


V3±U 


Leu 


Arg 


uxn 


rg 


Leu 


His 


Gly 


Val 


oXU 


Leu 


Leu 


Lys 


He 










725 










730 










735 




irne 


Asn 


Lys 


Asn 




Lys 


Leu 


Gly 


Ser 


Ser 


Ser 


1 nr 


oer 


He 


Pro 


His 






74 0 










745 










750 






Leu 


Ser 


Ser 


Pr-n 
IT X vJ 


Pro 


Thr 


ys 


Phe 


Phe 


Val 


Ser 


1 nz. 


Pro 


Ser 


Gin 


Pro 
















7 60 










*7 
/ 








Ser 


Cys 


Lys 


X 


His 


L u 
eu 


Glu 


Thr 


Gly 


Thr 


Lys 


Asp 


oxn 


Glu 


Thr 


Val 




770 










775 










"7 ft n 










Cys 


Ser 


Lys 


AX a 


1 IIX 


ox u 


Gly 


Asp 


Asn 


Xaa 


Glu 


fne 


taxy 


Ala 


Phe 


Val 


785 










790 










795 










800 


Gly 


His 


Arg 


Asp 


Ser 


Metz 


Asp 


Leu 


Gin 


Arg 


Phe 


Lys 


uXU 


Thr 


Ser 


Asn 










805 










810 










815 




Lys 


He 


Jjys 


X xc> 




Ser 


Asn 


Asn 


Asn 


Thr 


Ser 


OXU 


Asn 


Thr 


Leu 


Lvs 






820 










825 










830 






Arg Val 


Q AT* 

OCX 


Ser 


Leu 


Ala 


Gly 


Phe 


Thr 


Asp 


Cys 


IlXS 


Arg 


Thr 


Ser 


He 






835 










840 










845 








Pro 


Val 


His 


Ser 


Lys 


Gin 


Ala 


Glu 


Lys 


He 


Ser 


Arg 


Arg 


Pro 


Ser 


Thr 




850 










855 










860 










Glu 


Asp 


Thr 


His 


Glu 


Val 


Asp 


Ser 


Lys 


Ala 


Ala 


Leu 


He 


Pro 


ASD 


Tro 


865 










870 










875 










880 


Leu 


Gin 


sp 


Arg 


Pro 


Ser 


Asn 


/AX y 


Glu 


Met 


Pro 


Ser 


Glu 


Glu 


Glv 


Thr 










885 










890 










895 




Leu 


Asn 


Gly 


Leu 


Thr 


Ser 


Pro 


Phe 


Lys 


Pro 


Ala 


Met 


Asp 


Thr 


Asn 


Tvr 








900 










905 










910 






Tyr 


Tyr 


Ser 


Ala 


Val 


Glu 


Arg 


Asn 


Asn 


Leu 


Met 


Arg 


Leu 


Ser 


Gin 


Ser 






915 










920 










925 








lie 


Pro 


Phe 


Thr 


Pro 


Val 


Pro 


Pro 


Arg 


Glv 


Glu 


Pro 


Val 


Thr 


Val 


Tyr 




930 










935 










940 










Arg 


Leu 


Glu 


Glu 


Ser 


Ser 


Pro 


Asn 


He 


Leu 


Asn 


Asn 


Ser 


Met 


Ser 


Ser 


94 5 










950 










955 










960 


Trp 


Ser 


Gin 


Leu 


Gly 


Leu 


Cys 


Ala 


Lys 


He 


Glu 


Phe 


Leu 


Ser 


Lys 


Glu 










965 










970 










975 




Glu 


Met 


Gly 


Gly 


Gly 


Leu 


Arg 


Arg 


Ala 


Val 


Lys 


Val 


Gin 


Cys 


Thr 


Trp 








980 










985 










990 






Ser 


Glu 


His 


Asp 


He 


Leu 


Lys 


Ser 


Gly 


His 


Leu 


Tyr 


He 


He 


Lys 


Ser 






995 










1000 








1005 






Phe 


Leu 


Pro 


Glu 


Val 


Val 


Asn 


Thr 


Trp 


Ser 


Ser 


He 


Tyr 


Lys 


Glu 


Asp 




1010 








1015 








1020 








Thr 


Val 


Leu 


His 


Leu 


Cys 


Leu 


Arg 


Glu 


He 


Gin 


Gin 


Gin 


Arg 


Ala 


Ala 


1025 








1030 








1035 








104 


Gin 


Lys 


Leu 


Thr 


Phe 


Ala 


Phe 


Asn 


Gin 


Met 


Lys 


Pro 


Lys 


Ser 


He 


Pro 








1045 








1050 








1055 
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Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His Ser Ala 

1060 1065 1070 

Gly Gin Trp Phe Ala Val Glu Glu Cys Met Thr Gly Glu Phe Arg Lys 

1075 1080 1085 

Tyr Asn Asn Asn Asn Gly Asp Glu lie lie Pro Thr Asn Thr Leu Glu 

1090 1095 1100 

Glu lie Met Leu Ala Phe Ser His Trp Thr Tyr Glu Tyr Thr Arg Gly 
1105 1110 1115 112 

Glu Leu Leu Val Leu Asp Leu Gin Gly Val Gly Glu Asn Leu Thr Asp 

1125 1130 1135 

Pro Ser Val lie Lys Ala Glu Glu Lys Arg Ser Cys Asp Met Val Phe 

1140 1145 1150 

Gly Pro Ala Asn Leu Gly Glu Asp Ala lie Lys Asn Phe Arg Ala Lys 

1155 1160 1165 

His His Cys Asn Ser Cys Cys Arg Lys Leu Lys Leu Pro Asp Leu Lys 

1170 1175 1180 

Arg Asn Asp Tyr Thr Pro Asp Lys lie He Phe Pro Gin Asp Glu Pro 
1185 1190 1195 120 

Ser Asp Leu Asn Leu Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu Ser 

1205 1210 1215 

Ala Asn Ser Val Arg Leu Met Leu 
1220 



<210> 25 

<211> 2180 

<212> DNA 

<213> Homo Sapiens 



<400> 25 

tcgaggccaa gaattcggca cgagggcctc gggcaggccc cctggagcga cctgcttctt 60 

tgggcactgt tgctgaacag ggcacagatg gccatgtact tctgggagat gggttccaat 120 

gcagtttcct cagctcttgg ggcctgtttg ctgctccggg tgatggcacg cctggagcct 180 

gacgctgagg aggcagcacg gaggaaagac ctggcgttca agtttgaggg gatgggcgtt 240 

gacctctttg gcgagtgcta tcgcagcagt gaggtgaggg ctgcccgcct cctcctccgt 300 

cgctgcccgc tctgggggga tgccacttgc ctccagctgg ccatgcaagc tgacgcccgt 360 

gccttctttg cccaggatgg ggtacagtct ctgctgacac agaagtggtg gggagatatg 4 20 

gccagcacta cacccatctg ggccctggtt ctcgccttct tttgccctcc actcatctac 480 

acccgcctca tcaccttcag gaaatcagaa gaggagccca cacgggagga gctagagttt 540 

gacatggata gtgtcattaa tggggaaggg cctgtcggga cggcggaccc agccgagaag 600 

acgccgctgg gggtcccgcg ccagtcgggc cgtccgggtt gctgcggggg ccgctgcggg 660 

gggcgccggt gcctacgccg ctggttccac ttctggggcg cgccggtgac catcttcatg 720 

ggcaacgtgg tcagctacct gctgttcctg ctgcttttct cgcgggtgct gctcgtggat 780 

ttccagccgg cgccgcccgg ctccctggag ctgctgctct atttctgggc tttcacgctg 840 

ctgtgcgagg aactgcgcca gggcctgagc ggaggcgggg gcagcctcgc cagcgggggc 900 

cccgggcctg gccatgcctc actgagccag cgcctgcgcc tctacctcgc cgacagctgg 960 

aaccagtgcg acctagtggc tctcacctgc ttcctcctgg gcgtgggctg ccggctgacc 1020 

ccgggtttgt accacctggg ccgcactgtc ctctgcatcg acttcatggt tttcacggtg 1080 

cggctgcttc acatcttcac ggtcaacaaa cagctggggc ccaagatcgt catcgtgagc 1140 

aagatgatga aggacgtgtt cttcttcctc ttcttcctcg gcgtgtggct ggtagcctat 1200 

ggcgtggcca cggaggggct cctgaggcca cgggacagtg acttcccaag tatcctgcgc 12 60 

cgcgtcttct accgtcccta cctgcagatc ttcgggcaga ttccccagga ggacatggac 132 0 

gtggccctca tggagcacag caactgctcg tcggagcccg gcttctgggc acaccctcct 138 0 

ggggcccagg cgggcacctg cgtctcccag tatgccaact ggctggtggt gctgctcctc 14 4 0 

gtcatcttcc tgctcgtggc caacatcctg ctggtcaact tgctcattgc catgttcagt 1500 

tacacattcg gcaaagtaca gggcaacagc gatctctact ggaaggcgca gcgttaccgc 1560 

ctcatccggg aattccactc tcggcccgcg ctggccccgc cctttatcgt catctcccac 1620 

ttgcgcctcc tgctcaggca attgtgcagg cgacccsgga gcccccagcc gtcctccccg 1680 

gccctcgagc atttccgggt ttacctttct aaggaagccg agcggaagct gctaacgtgg 174 0 

gaatcggtgc ataaggagaa ctttctgctg gcacgcgcta gggacaagcg ggagagcgac 1800 

tccgagmgtc tgaagcgcac gtcccagaag gtggacttgg cactgaaaca gctgggacac 1860 
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atccgcgagc acgaacagcg cctgaaagtg ctggagcggg aggtccagca gtgtacctcg 1920 

gcccccgcac ctggtggcct tgtccttgag gtgagcccca tgtccatctg ggccactgtc 1980 

aggaccacct ttgggagtgt catccttaca aaccacagca tgcccggctc ctcccagaac 204 0 

cagtcccagc ctgggaggat caaggcctgg atcccrggcc gttatccatc tggaggctgc 2100 

agggtccttg gggtaacagg gaccacagac ccctcaccac tcacagattc ctcacactgg 2160 

ggaaataaag ccatttcaga 2180 

<210> 26 
<211> 725 
<212> PRT 

<213> Homo Sapiens 



<220> 

<221> UNSURE 

<222> (553) . . . (553) 

<223> UNKNOWN 

<221> UNSURE 

<222> (603) . . . (603) 

<223> UNKNOWN 





<400> 


26 


Ser 


Arg 


Pro 


Arg 


1 








Asp 


Leu 


Leu 


Leu 








20 


Xyr 


Phe 


Trp 


i3j.U 






35 




Cys 


Leu 


Leu 


Leu 




50 






Ala 


Ala 


Arg 


Arg 


65 








Asp 


Leu 


Phe 


Gly 


Leu 


Leu 


Leu 


Arg 








100 


Leu 


Ala 


Met 


Gin 






115 




Gin 


Ser 


Leu 


Leu 




130 






Pro 


lie 


Trp 


Ala 


145 








Thr 


Arg 


Leu 


lie 


Glu 


Leu 


Glu 


Phe 








180 


Gly 


Thr 


Ala 


Asp 






195 




Ser 


Gly 


Arg 


Pro 




210 






Leu 


Arg 


Arg 


Trp 


225 








Gly 


Asn 


Val 


Val 


Leu 


Leu 


Val 


Asp 








260 


Leu 


Tyr 


Phe 


Trp 






275 




Leu 


Ser 


Gly 


Gly 




290 







lie Arg His Glu 
5 

Trp Ala Leu Leu 

Met Gly Ser Asn 
40 

Arg Val Met Ala 
55 

Lys Asp Leu Ala 
70 

Glu Cys Tyr Arg 
85 

Arg Cys Pro Leu 

Ala Asp Ala Arg 
120 

Thr Gin Lys Trp 
135 

Leu Val Leu Ala 
150 

Thr Phe Arg Lys 
165 

Asp Met Asp Ser 

Pro Ala Glu Lys 
200 

Gly Cys Cys Gly 
215 

Phe His Phe Trp 
230 

Ser Tyr Leu Leu 
245 

Phe Gin Pro Ala 

Ala Phe Thr Leu 
280 

Gly Gly Ser Leu 
295 



Gly Leu Gly Gin 
10 

Leu Asn Arg Ala 
25 

Ala Val Ser Ser 

Arg Leu Glu Pro 
60 

Phe Lys Phe Glu 
75 

Ser Ser Glu Val 
90 

Trp Gly Asp Ala 
105 

Ala Phe Phe Ala 

Trp Gly Asp Met 
140 

Phe Phe Cys Pro 
155 

Ser Glu Glu Glu 
170 

Val lie Asn Gly 
185 

Thr Pro Leu Gly 

Gly Arg Cys Gly 
220 

Gly Ala Pro Val 
235 

Phe Leu Leu Leu 
250 

Pro Pro Gly Ser 
265 

Leu Cys Glu Glu 

Ala Ser Gly Gly 
300 



Ala Pro Trp Ser 
15 

Gin Met Ala Met 
30 

Ala Leu Gly Ala 
45 

Asp Ala Glu Glu 

Gly Met Gly Val 
80 

Arg Ala Ala Arg 
95 

Thr Cys Leu Gin 
110 

Gin Asp Gly Val 
125 

Ala Ser Thr Thr 

Pro Leu lie Tyr 
160 

Pro Thr Arg Glu 
175 

Glu Gly Pro Val 
190 

Val Pro Arg Gin 
205 

Gly Arg Arg Cys 

Thr He Phe Met 
240 

Phe Ser Arg Val 
255 

Leu Glu Leu Leu 
270 

Leu Arg Gin Gly 
285 

Pro Gly Pro Gly 
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His 


Ala 


Ser 


Leu 


Ser 


Gin 


Arg 


Leu 


Arg 


Leu 


Tyr 


Leu 


AX a 


Asp 


Ser 


Trp 


305 










310 










315 












Asn 


Gin 


Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly Val 










325 










o "3 n 










335 




Cys 


Arg 


Leu 


Thr 


Pro 


Gly 


Leu 


Tyr 


His 


Leu 


Gly Arg 


Thr 


Val 


Leu 


Cys 




340 










345 










350 






He 


Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


T 1 

lie 


Phe 


Thr 


va± 




355 










OOU 










365 








Asn 


Lys 


Gin 


Leu 


Gly 


Pro 


Lys 


He 


Val 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 




370 










375 










380 










Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Gly 


Val 


Trp 


Leu 


Val 


Ala 


Tyr 


385 










390 










395 












Gly Val 


Ala 


Thr 


Glu 


Gly 


Leu 


Leu 


Arg 


Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 










405 










410 










415 




Ser 


He 


Leu 


Arg 


Arg 


Val 


Phe 


Tyr 


Arg 


Pro 


Tyr 


Leu 


Gin 


He 


Phe 


o 1 » » 
Ksly 








420 










420 










430 






Gin 


He 


Pro 


Gin 


Glu 


Asp 


Met 


Asp 


Val 


Ala 


Leu 


Met 


Glu 


His 


Ser 


Asn 






435 










4 40 










445 








Cys 


Ser 


Ser 


Glu 


Pro 


Gly 


Phe 


Trp 


Ala 


His 


Pro 


Pro 


Gly Ala 


Gin 


AJia 


450 










4 55 










4 60 










Gly 


Thr 


Cys 


Val 


Ser 


Gin 


Tyr 


Ala 


Asn 


Trp 


Leu 


Val 


Val 


Leu 


Leu 


Leu 


465 








470 










475 










A an 


Val 


He 


Phe 


Leu 


Leu 


Val 


Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 










485 










490 










4 95 




Ala 


Met 


Phe 


Ser 


Tyr 


Thr 


Phe 


Gly 


Lys 


Val 


Gin 


Gly Asn 


Ser 


Asp 


Lieu 








500 








505 










510 






Tyr 


Trp 


Lys 


Ala 


Gin 


Arg 


Tyr 


Arg 


Leu 


He 


Arg 


Glu 


Phe 


His 


Ser 


Arg 


515 










520 










525 








Pro 


Ala 


Leu 


Ala 


Pro 


Pro 


Phe 


He 


Val 


He 


Ser 


His 


Leu 


Arg 


Leu 


Leu 




530 










535 










540 










Leu 


Arg 


Gin 


Leu 


Cys 


Arg 


Arg 


Pro 


Xaa 


Ser 


Pro 


Gin 


Pro 


Ser 


Ser 


Pro 


545 








550 










555 










Z> bU 


Ala 


Leu 


Glu 


His 


Phe 


Arg 


Val 


Tyr 


Leu 


Ser 


Lys 


Glu 


Ala 


Glu 


Arg 


Lys 










565 










570 










575 




Leu 


Leu 


Thr 


Trp 


Glu 


Ser 


Val 


His 


Lys 


Glu 


Asn 


Phe 


Leu 


Leu 


Ala 


Arg 








580 










585 










590 






Ala 


Arg 


Asp 


Lys 


Arg 


Glu 


Ser 


Asp 


Ser 


Glu 


Xaa 


Leu 


Lys 


Arg 


Thr 


Ser 




595 










600 










605 








Gin 


Lys 


Val 


Asp 


Leu 


Ala 


Leu 


Lys 


Gin 


Leu 


Gly 


His 


He 


Arg 




Tyr 




610 










615 










620 










Glu 


Gin 


Arg 


Leu 


Lys 


Val 


Leu 


Glu 


Arg 


/—111 
GXU 


Val 


Gin 


Gin 


Cys 


Thr 


Ser 


625 


















635 










640 


Ala 


Pro 


Ala 


Pro 


Gly 


Gly 


Leu 


Val 


Leu 


Glu 


Val 


Ser 


Pro 


Met 


Ser 


He 










645 










650 










655 




Trp 


Ala 


Thr 


Val 


Arg 


Thr 


Thr 


Phe 


Gly 


Ser 


Val 


He 


Leu 


Thr 


Asn 


His 








660 










665 










670 






Ser 


Met 


Pro 


Gly 


Ser 


Ser 


Gin 


Asn 


Gin 


Ser 


Gin 


Pro 


Gly Arg 


He 


Lys 






675 








680 










685 








Ala 


Trp 


He 


Pro 


Gly 


Arg 


Tyr 


Pro 


Ser 


Gly 


Gly 


Cys 


Arg 


Val 


Leu 


Gly 




690 










695 










700 










Val 


Thr 


Gly 


Thr 


Thr 


Asp 


Pro 


Ser 


Pro 


Leu 


Thr 


Asp 


Ser 


Ser 


His 


Trp 


705 








710 










715 










720 


Gly Asn 


Lys 


Ala 


He 

























725 



<210> 27 

<211> 7419 

<212> DNA 

<213> Homo Sapiens 
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<400> 21 

cggggaccga tccagcctcc ggactctagc ctaggctttt gcaaaaagct atrtaggtga 60 

cactatagaa ggtacgcctg caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc 120 

gcagccccgt cgccggcgga ggcgggcgcg ggcgcgtncc ctgtggccag tcacccggag 180 

gagttggtcg cacaattatg aaagactcgg ctuctgctgc tagcgccgga gctgagttag 240 

ttctgagaag gtttccctgg gcgttccttg tccggcggcc tctgctgccg cctccggaga 300 

cgcttcccga tagatggcta caggccgcgg aggaggagga ggtggagttg ctgcccttcc 360 

ggagtccgcc ccgtgaggag aatgtcccag aaatcctgga tagaaagcac tttgaccaag 420 

agggaatgtg tatatattat accaagttcc aaggaccctc acagatgcct tccaggatgt 480 

caaatttgtc agcaactcgt caggtgtttt tgtggtcgct tggtcaagca acacgcttgt 540 

tttactgcaa gtcttgccat gaaatactca gatgtgaaat tgggtgacca ttttaatcag 600 

gcaatagaag aatggtctgt ggaaaagcat acagaacaga gcccaacgga tgcttatgga 660 

gtcataaatt ttcaaggggg ttctcattcc tacagagcta agtatgtgag gctatcatat 720 

gacaccaaac ctgaagtcat tctgcaactt ctgcttaaag aatggcaaat ggagttaccc 780 

aaacttgtta tctctgtaca tgggggcatg cagaaatttg agcttcaccc acgaatcaag 840 

cagttgcttg gaaaaggtct tattaaagct gcagttacaa ctggagcctg gattttaact 900 

ggaggagtaa acacaggtgt ggcaaaacat gttggagatg ccctcaaaga acatgcttcc 960 

agatcatctc gaaagatttg cactatcgga atagctccat ggggagtgat tgaaaacaga 1020 

aatgatcttg ttgggagaga tgtggttgct ccttatcaaa ccttattgaa ccccctgagc 1080 

aaattgaatg ttttgaataa tctgcattcc catttcatat tggtggatga tggcactgtt 1140 

ggaaagtatg gggcggaagt cagactgaga agagaacttg aaaaaactat taatcagcaa 1200 

agaattcatg ctaggattgg ccagggtgtc cctgtggtgg cacttatatt tgagggtggg 1260 

ccaaatgtta tcctcacagt tcttgaatac cttcaggaaa gcccccctgt tccagtagtt 1320 

gtgtgtgaag gaacaggcag agctgcagat ctgctagcgt atattcataa acaaacagaa 1380 

gaaggaggga atcttcctga tgcagcagag cccgatatta tttccactat caaaaaaaca 1440 

tttaactttg gccagaatga agcacttcat ttatttcaaa cactgatgga gtgcatgaaa 1500 

agaaaggagc ttatcactgt tttccatatt gggtcagatg aacatcaaga tatagatgta 1560 

gcaatactta ctgcactgct aaaaggtact aatgcatctg catttgacca gcttatcctt 1620 

acattggcat gggatagagt tgacattgcc aaaaatcatg tatttgttta tggacagcag 168 0 

tggctggttg gatccttgga acaagctatg cttgatgctc ttgtaatgga tagagttgca 1740 

tttgtaaaac ttcttattga aaatggagta agcatgcata aattccttac cattccgaga 1800 

ctggaagaac tttacaacac taaacaaggt ccaactaatc caatgctgtt tcatcttgtt 1860 

cgagacgtca aacagggaaa tcttcctcca ggatataaga tcactctgat tgatatagga 1920 

cttgttattg aatatctcat gggaggaacc tacagatgca cctatactag gaaacgtttt 1980 

cgattaatat ataatagtct tggtggaaat aatcggaggt ctggccgaaa tacctccagc 2040 

agcactcctc agttgcgaaa gagtcatgaa tcttttggca atagggcaga taaaaaggaa 2100 

aaaatgaggc ataaccattt cattaagaca gcacagccct tccgaccaaa gattgataca 2160 

gttatggaag aaggaaagaa gaaaagaacc aaagatgaaa ttgtagacat tgatgatcca 2220 

gaaaccaagc gctttcctta tccacttaat gaacttttaa tttgggcttg ccttatgaag 2280 

aggcaggtca tggcccgttt tttatggcaa catggtgaag aatcaatggc taaagcatta 2340 

gttgcctgta agatctatcg ttcaatggca tatgaagcaa agcagagtga cctggtagat 2400 

gatacttcag aagaactaaa acagtattcc aatgattttg gtcagttggc cgttgaatta 2460 

ttagaacagt ccttcagaca agatgaaacc atggctatga aattgctcac ttatgaactg 2520 

aagaactgga gtaattcaac ctgccttaag ttagcagttt cttcaagact tagacctttt 2580 

gtagctcaca cctgtacaca aatgttgtta tctgatatgt ggatgggaag gctgaatatg 2640 

aggaaaaatt cctggtacaa ggtcatacta agcattttag ttccacctgc catattgctg 2700 

ttagagtata aaactaaggc tgaaatgtcc catatcccac aatctcaaga tgctcatcag 2760 

atgacaatgg atgacagcga aaacaacttt cagaacataa cagaagagat ccccatggaa 2820 

gtgtttaaag aagtacggat tttggatagt aatgaaggaa agaatgagat ggagatacaa 2880 

atgaaatcaa aaaagcttcc aattacgcga aagttttatg ccttttatca tgcaccaatt 2940 

gtaaaattct ggtttaacac gttggcatat ttaggatttc tgatgcttta tacatttgtg 3000 

gttcttgtac aaatggaaca gttaccttca gttcaagaat ggattgttat tgcttatatt 3060 

tttacttatg ccattgagaa agtccgtgag atctttatgt ctgaagctgg gaaagtaaac 3120 

cagaagatta aagtatggtt tagtgattac ttcaacatca gtgatacaat tgccataatt 3180 

tctttcttca ttggatttgg actaagattt ggagcaaaat ggaactttgc aaatgcatat 324 0 

gataatcatg tttttgtggc tggaagatta atttactgtc ttaacataat attttggtat 3300 

gtgcgtttgc tagattttct agctgtaaat caacaggcag gaccttatgt aatgatgatt 3360 

ggaaaaatgg tggccaatat gttctacatt gtagtgatta tggctcttgt attacttagt 34 20 

tttggtgttc ccagaaaggc aatactttat cctcatgaag caccatcttg gactcttgct 34 80 

aaagatatag tttttcaccc atactggatg atttttggtg aagtttatgc atacgaaatt 3540 
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gatgtgtgtg caaatgattc tgttatccct caaatctgtg gtcctgggac gtggttgact 3600 

ccatttcttc aagcagtcta cctctttgta cagtatatca ttatggttaa tcttcttatt 3660 

gcatttttca acaatgtgta tttacaagtg aaggcaattt ccaatattgt atggaagtac 3720 

cagcgttatc attttattat ggcttatcat gagaaaccag ttctgcctcc tccacttatc 3780 

attcttagcc atatagtttc tctgttttgc tgcatatgta agagaagaaa gaaagataag 3840 

acttccgatg gaccaaaact tttcttaaca gaagaagatc aaaagaaact tcatgatttt 3900 

gaagagcagt gtgttgaaat gtatttcaat gaaaaagatg acaaatttca ttctgggagt 3960 

gaagagagaa ttcgtgtcac ttttgaaaga gtggaacaga tgtgcattca gattaaagaa 4020 

gttggagatc gtgtcaacta cataaaaaga tcattacaat cattagattc tcaaattggc 4080 

catttgcaag atctttcagc cctgacggta garacattaa aaacactcac tgcccagaaa 4140 

gcgtcggaag ctagcaaagt tcataatgaa arcacacgag aactgagcat ttccaaacac 4200 

ttggctcaaa accttattga tgatggtcct graagacctt ctgtatggaa aaagcatggt 4260 

gttgtaaata cacttagctc ctctcttcct caaggtgatc ttgaaagtaa taatcctttt 4320 

cattgtaata ttttaatgaa agatgacaaa ga-ccccagt gtaatatatt tggtcaagac 4380 

ttacctgcag taccccagag aaaagaattt aartttccag aggctggttc ctcttctggt 4440 

gccttattcc caagtgctgt ttcccctcca gaactgcgac agagactaca tggggtagaa 4 500 

ctcttaaaaa tatttaataa aaatcaaaaa ttaggcagtt catctactag cataccacat 4560 

ctgtcatccc caccaaccaa attttttgtt agtacaccat ctcagccaag ttgcaaaagc 4 620 

cacttggaaa ctggaaccaa agatcaagaa actgtttgct ctaaagctac agaaggagat 4 680 

aatacagaat ttggagcatt tgtaggacac agagatagca tggatttaca gaggtttaaa 4 74 0 

gaaacatcaa acaagataaa aatactatcc aataacaata cttctgaaaa cactttgaaa 4800 

cgagtgagtt ctcttgctgg atttactgac tgtcacagaa cttccattcc tgttcattca 48 60 

aaacaagcag aaaaaatcag tagaaggcca tcraccgaag acactcatga agtagattcc 4 920 

aaagcagctt taataccgga ttggttacaa gatagaccat caaacagaga aatgccatct 4 980 

gaagaaggaa cattaaatgg tctcacttct ccatttaagc cagctatgga tacaaattac 5040 

tattattcag ctgtggaaag aaataacttg atgaggttat cacagagcat tccatttaca 5100 

cctgtgcctc caagagggga gcctgtcaca gtgtatcgtt tggaagagag ttcacccaac 5160 

atactaaata acagcatgtc ttcttggtca caactaggcc tctgtgccaa aatagagttt 5220 

ttaagcaaag aggagatggg aggaggttta cgaagagctg tcaaagtaca gtgtacctgg 5280 

tcagaacatg atatcctcaa atcagggcat ctttatatta tcaaatcttt tcttccagag 5340 

gtggttaata catggtcaag tatttataaa gaagatacag ttctgcatct ctgtctgaga 5400 

gaaattcaac aacagagagc agcacaaaag cttacgtttg cctttaatca aatgaaaccc 54 60 

aaatccatac catattctcc aaggttcctt gaagttttcc tgctgtattg ccattcagca 5520 

ggacagtggt ttgctgtgga agaatgtatg actggagaat ttagaaaata caacaataat 5580 

aatggagatg agattattcc aactaatact ctggaagaga tcatgctagc ctttagccac 5640 

tggacttacg aatatacaag aggggagtta ctggtacttg atttgcaagg tgttggtgaa 5700 

aatttgactg acccatctgt gataaaagca gaagaaaaga gatcctgtga tatggttttt 5760 

ggcccagcaa atctaggaga agatgcaatt aaaaacttca gagcaaaaca tcactgtaat 5820 

tcttgctgta gaaagcttaa acttccagat ctgaagagga atgattatac gcctgataaa 5880 

attatatttc ctcaggatga gccttcagat ttgaatcttc agcctggaaa ttccaccaaa 5940 

gaatcagaat caactaattc tgttcgtctg atgttataat attaatatta ctgaatcatt 6000 

ggttttgcct gcacctcaca gaaatgttac tgtgtcactt ttccctcggg aggaaattgt 6060 

ttggtaatat agaaaggtgt atgcaagttg aatttgctga ctccagcaca gttaaaaggt 6120 

caatattctt ttgacctgat taatcagtca gaaagtccct ataggataga gctggcagct 6180 

gagaaatttt aaaggtaatt gataattagt atttgtaact ttttaaaggg ctctttgtat 6240 

agcagaggat ctcatttgac tttgttttga tgagggtgat gccctctctt atgtggtaca 6300 

ataccattaa ccaaaggtag gtgtccatgc agattttatt ggcagctgtt ttattgccat 6360 

tcaactaggg aaatgaagaa atcacgcagc cttttggtta aatggcagtc aaaattttcc 6420 

tcagtgtatt tagtgtgttc agtgatgata tcactggttc ccaactagat gcttgttggc 6480 

cacgggaagg gaaatgactt gttctaattc taggttcaca gaggtatgag aagcctgaac 6540 

tgaagaccat tttcaagagg gacggtattt atgaatcagg gttaggctcc atatttaaag 6600 

atagagccag tttttttttt aaatagaacc caaattgtgt aaaaatgtta attgggtttt 6660 

ttaaacattg ttttatcaag tcactgttaa gtagaagaaa gccatggtaa actgatacat 6720 

aacctaaatt ataaaagcag aaacctaact cactcgtcaa gggaagttac cttttgagga 67 80 

aagttaaagt acttttttcc ctatctgtat ctatagcaac aacccagaac ttacaaactt 68 4 0 

ctccaaagat tttattgatt gttatatcaa atcagaatgt aaacatgaac tcttgcatat 6900 

atttaaaatt gtgttggaac atttgaacat gaatgctgtt tgggtactta agaaattrat 6960 

tcagtnggat tatcattatg tganactggc agattgcagt gcanccttat gccaataaaa 7020 

tgtaatttar cagccccaga tattgttgaa tattcaacaa taacaagaaa agcttttcat 7080 

ctaagtttta tgctttaatt ttttttcttt ttttttcttt ttcttttgtt tccttggtac 7140 
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taattttaat ttttatttgg aagggagcag tataaagctt atttgtattt agtagtgtat 7200 

ctcatagata cagacaaggc aagagatgat aagctgttta aatagtgktt aatattgatt 7260 

ggg99tgggg agaaagaaaa agtgtattac ttaaagatac tatatacskt ttktatatca 7320 

ttaaatcttt aaaagaaatn naataaattt attgtttnca aaaaaaaaac ccnntaaaaa 7380 

aaaaagggcg gcccctctag aggatccctc gaggggccc 7419 
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Tyr His Ala Pro lie Val Lys Phe Trp Phe Asn Thr Leu Ala Tyr Leu 

850 855 860 

Gly Phe Leu Met Leu Tyr Thr Phe Val Val Leu Val Gin Met Glu Gin 
865 B70 875 880 

Leu Pro Ser Val Gin Glu Trp He Val He Ala Tyr He Phe Thr Tyr 

885 890 895 

Ala He Glu Lys Val Arg Glu He Phe Met Ser Glu Ala Gly Lys Val 

900 905 910 

Asn Gin Lys He Lys Val Trp Phe Ser Asp Tyr Phe Asn He Ser Asp 

915 920 925 

Thr He Ala He He Ser Phe Phe He Gly Phe Gly Leu Arg Phe Gly 

930 935 940 

Ala Lys Trp Asn Phe Ala Asn Ala Tyr Asp Asn His Val Phe Val Ala 
945 950 955 960 

Gly Arg Leu He Tyr Cys Leu Asn He He Phe Trp Tyr Val Arg Leu 

965 970 975 

Leu Asp Phe Leu Ala Val Asn Gin Gin Ala Gly Pro Tyr Val Met Met 

980 985 990 

He Gly Lys Met Val Ala Asn Met Phe Tyr He Val Val He Met Ala 

995 1000 1005 

Leu Val Leu Leu Ser Phe Gly Val Pro Arg Lys Ala He Leu Tyr Pro 

1010 1015 1020 

His Glu Ala Pro Ser Trp Thr Leu Ala Lys Asp He Val Phe His Pro 
1025 1030 1035 1040 

Tyr Trp Met He Phe Gly Glu Val Tyr Ala Tyr Glu He Asp Val Cys 

1045 1050 1055 

Ala Asn Asp Ser Val He Pro Gin He Cys Gly Pro Gly Thr Trp Leu 

1060 1065 1070 

Thr Pro Phe Leu Gin Ala Val Tyr Leu Phe Val Gin Tyr He He Met 

1075 1080 1085 

Val Asn Leu Leu He Ala Phe Phe Asn Asn Val Tyr Leu Gin Val Lys 

1090 1095 1100 

Ala He Ser Asn He Val Trp Lys Tyr Gin Arg Tyr His Phe He Met 
1105 1110 1115 1120 

Ala Tyr His Glu Lys Pro Val Leu Pro Pro Pro Leu He He Leu Ser 

1125 1130 1135 

His He Val Ser Leu Phe Cys Cys He Cys Lys Arg Arg Lys Lys Asp 

1140 1145 1150 

Lys Thr Ser Asp Gly Pro Lys Leu Phe Leu Thr Glu Glu Asp Gin Lys 

1155 1160 1165 

Lys Leu His Asp Phe Glu Glu Gin Cys Val Glu Met Tyr Phe Asn Glu 

1170 1175 1180 

Lys Asp Asp Lys Phe His Ser Gly Ser Glu Glu Arg He Arg Val Thr 
1185 1190 1195 1200 

Phe Glu Arg Val Glu Gin Met Cys He Gin He Lys Glu Val Gly Asp 

1205 1210 1215 

Arg Val Asn Tyr He Lys Arg Ser Leu Gin Ser Leu Asp Ser Gin He 

1220 1225 1230 

Gly His Leu Gin Asp Leu Ser Ala Leu Thr Val Asp Thr Leu Lys Thr 

1235 1240 1245 

Leu Thr Ala Gin Lys Ala Ser Glu Ala Ser Lys Val His Asn Glu He 

1250 1255 1260 

Thr Arg Glu Leu Ser He Ser Lys His Leu Ala Gin Asn Leu He Asp 
1265 1270 1275 1280 

Asp Gly Pro Val Arg Pro Ser Val Trp Lys Lys His Gly Val Val Asn 

1285 1290 1295 

Thr Leu Ser Ser Ser Leu Pro Gin Gly Asp Leu Glu Ser Asn Asn Pro 

1300 1305 1310 

Phe His Cys Asn He Leu Met Lys Asp Asp Lys Asp Pro Gin Cys Asn 

1315 1320 1325 
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Ile Phe Gly Gin Asp Leu Pro Ala Val Pro Gin Arg Lys Glu Phe Asn 

1330 1335 1340 

Phe Pro Glu Ala Gly Ser Ser Ser Gly Ala Leu Phe Pro Ser Ala Val 
1345 1350 1355 1360 

Ser Pro Pro Glu Leu Arg Gin Arg Leu His Gly Val Glu Leu Leu Lys 

1365 1370 1375 

lie Phe Asn Lys Asn Gin Lys Leu Gly Ser Ser Ser Thr Ser lie Pro 

1380 1385 1390 

His Leu Ser Ser Pro Pro Thr Lys Phe Phe Val Ser Thr Pro Ser Gin 

1395 1400 1405 

Pro Ser Cys Lys Ser His Leu Glu Thr Gly Thr Lys Asp Gin Glu Thr 

1410 1415 1420 

Val Cys Ser Lys Ala Thr Glu Gly Asp Asn Thr Glu Phe Gly Ala Phe 
1425 1430 1435 1440 

Val Gly His Arg Asp Ser Met Asp Leu Gin Arg Phe Lys Glu Thr Ser 

1445 1450 1455 

Asn Lys lie Lys lie Leu Ser Asn Asn Asn Thr Ser Glu Asn Thr Leu 

1460 1465 1470 

Lys Arg Val Ser Ser Leu Ala Gly Phe Thr Asp Cys His Arg Thr Ser 

1475 1480 1485 

lie Pro Val His Ser Lys Gin Ala Glu Lys lie Ser Arg Arg Pro Ser 

1490 1495 1500 

Thr Glu Asp Thr His Glu Val Asp Ser Lys Ala Ala Leu lie Pro Asp 
1505 1510 1515 1520 

Trp Leu Gin Asp Arg Pro Ser Asn Arg Glu Met Pro Ser Glu Glu Gly 

1525 1530 1535 

Thr Leu Asn Gly Leu Thr Ser Pro Phe Lys Pro Ala Met Asp Thr Asn 

1540 1545 1550 

Tyr Tyr Tyr Ser Ala Val Glu Arg Asn Asn Leu Met Arg Leu Ser Gin 

1555 1560 1565 

Ser lie Pro Phe Thr Pro Val Pro Pro Arg Gly Glu Pro Val Thr Val 

1570 1575 1580 

Tyr Arg Leu Glu Glu Ser Ser Pro Asn lie Leu Asn Asn Ser Met Ser 
1585 1590 1595 1600 

Ser Trp Ser Gin Leu Gly Leu Cys Ala Lys He Glu Phe Leu Ser Lys 

1605 1610 1615 

Glu Glu Met Gly Gly Gly Leu Arg Arg Ala Val Lys Val Gin Cys Thr 

1620 1625 1630 

Trp Ser Glu His Asp He Leu Lys Ser Gly His Leu Tyr He He Lys 

1635 1640 1645 

Ser Phe Leu Pro Glu Val Val Asn Thr Trp Ser Ser He Tyr Lys Glu 

1650 1655 1660 

Asp Thr Val Leu His Leu Cys Leu Arg Glu He Gin Gin Gin Arg Ala 
1665 1670 1675 1680 

Ala Gin Lys Leu Thr Phe Ala Phe Asn Gin Met Lys Pro Lys Ser He 

1685 1690 1695 

Pro Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His Ser 

1700 1705 1710 

Ala Gly Gin Trp Phe Ala Val Glu Glu Cys Met Thr Gly Glu Phe Arg 

1715 1720 1725 

Lys Tyr Asn Asn Asn Asn Gly Asp Glu He He Pro Thr Asn Thr Leu 

1730 1735 1740 

Glu Glu He Met Leu Ala Phe Ser His Trp Thr Tyr Glu Tyr Thr Arg 
1745 1750 1755 1760 

Gly Glu Leu Leu Val Leu Asp Leu Gin Gly Val Gly Glu Asn Leu Thr 

1765 1770 1775 

Asp Pro Ser Val He Lys Ala Glu Glu Lys Arg Ser Cys Asp Met Val 

1780 1785 1790 

Phe Gly Pro Ala Asn Leu Gly Glu Asp Ala He Lys Asn Phe Arg Ala 
1795 1800 1805 
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Lys His His Cys Asn Ser Cys Cys Arg Lys Leu Lys Leu Pro Asp Leu 

1810 1815 1820 

Lys Arg Asn Asp Tyr Thr Pro Asp Lys lie lie Phe Pro Gin Asp Glu 
1825 1830 1835 1840 

Pro Ser Asp Leu Asn Leu Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu 

1845 1850 1855 

Ser Thr Asn Ser Val Arg Leu Met Leu 
1860 1865 

<210> 29 

<211> 4061 

<212> DNA 

<213> Homo Sapiens 

<400> 29 

ggtctggaag cagagccggc ggagggagcg ccggggccct gggctgcagg aggttgcggc 60 

ggccgcggca gcatggtggt gccggagaag gagcagagct ggatccccaa gatcttcaag 120 

aagaagacct gcacgacgtt catagttgac tccacagatc cgggagggac cttgtgccag 180 

tgtgggcgcc cccggaccgc ccaccccgca gtggccatgg aggatgcctt cggggcagcc 24 0 

gtggtgaccg tgtgggacag cgatgcacac accacggaga agcccaccga tgcctacgga 300 

gagctggact tcacgggggc cggccgcaag cacagcaatt tcctccggct ctctgaccga 360 

acggatccag ctgcagttta tagtctggtc acacgcacat ggggcttccg tgccccgaac 420 

ctggtggtgt cagtgctggg gggatcgggg ggccccgtcc tccagacctg gctgcaggac 4 80 

ctgctgcgtc gtgggctggt gcgggctgcc cagagcacag gagcctggat tgtcactggg 54 0 

ggtctgcaca cgggcatcgg ccggcatgtt ggtgtggctg tacgggacca tcagatggcc 600 

agcactgggg gcaccaaggt ggtggccatg ggtgtggccc cctggggtgt ggtccggaat 660 

agagacaccc tcatcaaccc caagggctcg ttccctgcga ggtaccggtg gcgcggtgac 720 

ccggaggacg gggtccagtt tcccctggac tacaactact cggccttctt cctggtggac 78 0 

gacggcacac acggctgcct ggggggcgag aaccgcttcc gcttgcgcct ggagtcctac 84 0 

atctcacagc agaagacggg cgtgggaggg actggaattg acatccctgt cctgctcctc 900 

ctgattgatg gtgatgagaa gatgttgacg cgaatagaga acgccaccca ggctcagctc 960 

ccatgtctcc tcgtggctgg ctcaggggga gctgcggact gcctggcgga gaccctggaa 1020 

gacactctgg ccccagggag tgggggagcc aggcaaggcg aagcccgaga tcgaatcagg 1080 

cgtttctttc ccaaagggga ccttgaggtc ctgcaggccc aggtggagag gattatgacc 114 0 

cggaaggagc tcctgacagt ctattcttct gaggatgggt ctgaggaatt cgagaccata 1200 

gttttgaagg cccttgtgaa ggcctgtggg agctcggagg cctcagccta cctggatgag 1260 

ctgcgtttgg ctgtggcttg gaaccgcgtg gacattgccc agagtgaact ctttcggggg 1320 

gacatccaat ggcggtcctt ccatctcgaa gcttccctca tggacgccct gctgaatgac 1380 

cggcctgagt tcgtgcgctt gctcatttcc cacggcctca gcctgggcca cttcctgacc 14 4 0 

ccgatgcgcc tggcccaact ctacagcgcg gcgccctcca actcgctcat ccgcaacctt 1500 

ttggaccagg cgtcccacag cgcaggcacc aaagccccag ccctaaaagg gggagctgcg 1560 

gagctccggc cccctgacgt ggggcatgtg ctgaggatgc tgctggggaa gatgtgcgcg 1620 

ccgaggtacc cctccggggg cgcctgggac cctcacccag gccagggctt cggggagagc 1680 

atgtatctgc tctcggacaa ggccacctcg ccgctctcgc tggatgctgg cctcgggcag 1740 

gccccctgga gcgacctgct tctttgggca ctgttgctga acagggcaca gatggccatg 1800 

tacttctggg agatgggttc caatgcagtt tcctcagctc ttggggcctg tttgctgctc 1860 

cgggtgatgg cacgcctgga gcctgacgct gaggaggcag cacggaggaa agacctggcg 1920 

ttcaagtttg aggggatggg cgttgacctc tttggcgagt gctatcgcag cagtgaggtg 1980 

agggctgccc gcctcctcct ccgtcgctgc ccgctctggg gggatgccac ttgcctccag 204 0 

ctggccatgc aagctgacgc ccgtgccttc tttgcccagg atggggtaca gtctctgctg 2100 

acacagaagt ggtggggaga tatggccagc actacaccca tctgggccct ggttctcgcc 2160 

ttcttttgcc ctccactcat ctacacccgc ctcatcacct tcaggaaatc agaagaggag 2220 

cccacacggg aggagctaga gtttgacatg gatagtgtca ttaatgggga agggcctgtc 2280 

gggacggcgg acccagccga gaagacgccg ctgggggtcc cgcgccagtc gggccgtccg 234 0 

ggttgctgcg ggggccgctg cggggggcgc cggtgcctac gccgctggtt ccacttctgg 24 00 

ggcgcgccgg tgaccatctt catgggcaac gtggtcagct acctgctgtt cctgctgctt 24 60 

ttctcgcggg tgctgctcgt ggatttccag ccggcgccgc ccggctccct ggagctgctg 2520 

ctctatttct gggctttcac gctgctgtgc gaggaactgc gccagggcct gagcggaggc 2580 

gggggcagcc tcgccagcgg gggccccggg cctggccatg cctcactgag ccagcgcctg 2640 

cgcctctacc tcgccgacag ctggaaccag tgcgacctag tggctctcac ctgcttcctc 2700 
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ctgggcgtgg 
atcgacttca 
gggcccaaga' 
ctcggcgtgt 
agtgacttcc 
cagattcccc 
cccggcttct 
aactggctgg 
aacttgctca 
tactggaagg 
ccgcccttta 
cggagccccc 
gccgagcgga 
gctagggaca 
ttggcactga 
cgggaggtcc 
gccttgctgc 
ctgctggcgg 
tctgggcctc 
gggccactgt 
cctcccagaa 
ctggaggctg 
cctcacactg 



gctgccggct 
tggttttcac 
tcgtcatcgt 
ggctggtagc 
caagtatcct 
aggaggacat 
gggcacaccc 
tggtgctgct 
ttgccatgtt 
cgcagcgt ta 
tcgtcatctc 
agccgtcctc 
agctgctaac 
agcgggagag 
aacagctggg 
agcagtgtag 
ccccaggtgg 
acttcaagga 
ggcccccgca 
caggaccacc 
ccagtcccag 
cagggtcctt 
gggaaataaa 



gaccccgggt 
ggtgcggctg 
gagcaagatg 
ctatggcgtg 
gcgccgcgtc 
ggacgtggcc 
tcctggggcc 
cctcgtcatc 
cagttacaca 
ccgcctcatc 
ccacttgcgc 
cccggccctc 
gtgggaatcg 
cgactccgag 
acacatccgc 
ccgcgtcctg 
gccgccaccc 
gaagccccca 
cctggtggcc 
tttgggagtg 
cctgggagga 
ggggtaacag 
gccatttcag 
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ttgtaccacc 
ctrcacatct 
atgaaggacg 
gccacggagg 
ttctaccgtc 
ctcatggagc 
caggcgggca 
ttcctgctcg 
ttcggcaaag 
cgggaattcc 
ctcctgctca 
gagcatttcc 
gtgcataagg 
cgtctgaagc 
gagtacgaac 
gggtgggtgg 
cctgacctgc 
caggggattt 
ttgtccttga 
tcatccttac 
tcaaggcctg 
ggaccacaga 
aggaaaaaaa 



tgggccgcac 
tcacggtcaa 
tgttcttctt 
ggctcctgag 
cctacctgca 
acagcaactg 
cctgcgtctc 
tggccaacat 
tacagggcaa 
actctcggcc 
ggcaattgtg 
gggtttacct 
agaactttct 
gcacgtccca 
agcgcctgaa 
ccgaggccct 
ctgggtccaa 
tgctcctaga 
ggtgagcccc 
aaaccacagc 
gatcccgggc 
cccctcacca 
a 



tgtcctctgc 
caaacagctg 
cctcttcttc 
gccacgggac 
gatcttcggg 
ctcgtcggag 
ccagtatgcc 
cctgctggtc 
cagcgatctc 
cgcgctggcc 
caggcgaccc 
ttctaaggaa 
gctggcacgc 
gaaggtggac 
agtgctggag 
gagccgctct 
agactgagcc 
gtaaggctca 
atgtccatct 
atgcccggct 
cgttatccat 
ctcacagatt 



2760 
2820 
2880 
2940 

3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4061 
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<212> 


PRT 




























<213> 


Homo Sapiens 






















<400> 


30 


























Met 


Val 


Val 


Pro 


Glu 


Lys 


Glu 


Gin 


Ser 


Trp 


He 


Pro 


Lys 


He 


Phe 


Lys 


1 








5 










10 










15 




Lys 


Lys 


Thr 


Cys 


Thr 


Thr 


Phe 


He 


Val 


Asp 


Ser 


Thr 


Asp 


Pro 


Gly 


Gly 




20 










25 










30 






Thr 


Leu 


Cys 


Gin 


Cys 


Gly 


Arg 


Pro 


Arg 


Thr 


Ala 


His 


Pro 


Ala 


Val 


Ala 






35 










40 










45 








Met 


Glu 


Asp 


Ala 


Phe 


Gly 


Ala 


Ala 


Val 


Val 


Thr 


Val 


Trp 


Asp 


Ser 


Asp 




50 










55 










60 










Ala 


His 


Thr 


Thr 


Glu 


Lys 


Pro 


Thr 


Asp 


Ala 


Tyr 


Gly 


Glu 


Leu 


Asp 


Phe 


65 










70 










75 










80 


Thr 


Gly 


Ala 


Gly 


Arg 


Lys 


His 


Ser 


Asn 


Phe 


Leu 


Arg 


Leu 


Ser 


Asp 


Arg 








85 










90 










95 




Thr 


Asp 


Pro 


Ala 


Ala 


Val 


Tyr 


Ser 


Leu 


Val 


Thr 


Arg 


Thr 


Trp 


Gly 


Phe 






100 










105 










110 






Arg 


Ala 


Pro 


Asn 


Leu 


Val 


Val 


Ser 


Val 


Leu 


Gly 


Gly 


Ser 


Gly 


Gly 


Pro 




115 










120 










125 








Val 


Leu 


Gin 


Thr 


Trp 


Leu 


Gin 


Asp 


Leu 


Leu 


Arg 


Arg 


Gly 


Leu 


Val 


Arg 




130 










135 










140 










Ala 


Ala 


Gin 


Ser 


Thr 


Gly 


Ala 


Trp 


He 


Val 


Thr 


Gly 


Gly 


Leu 


His 


Thr 


145 










150 










155 










160 


Gly 


He 


Gly 


Arg 


His 


Val 


Gly 


Val 


Ala 


Val 


Arg 


Asp 


His 


Gin 


Met 


Ala 




165 










170 










175 




Ser 


Thr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Ala 


Met 


Gly 


Val 


Ala 


Pro 


Trp Gly 






180 










185 










190 






Val 


Val 


Arg 


Asn 


Arg 


Asp 


Thr 


Leu 


He 


Asn 


Pro 


Lys 


Gly 


Ser 


Phe 


Pro 






195 










200 










205 








Ala 


Arg 


Tyr 


Arg 


Trp 


Arg 


Gly 


Asp 


Pro 


Glu 


Asp 


Gly 


Val 


Gin 


Phe 


Pro 




210 










215 










220 










Leu 


Asp 


Tyr 


Asn 


Tyr 


Ser 


Ala 


Phe 


Phe 


Leu 


Val 


Asp 


Asp 


Gly 


Thr 


His 


225 






230 










235 










240 
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Gly 


Cys 


Leu 






Glu 




Arg 


Phe 


Arg 


Leu 


Arg 


Leu 


Glu 


Ser 


Tvr 






245 










250 










255 




T 1 d 


Ser 




260 


Lys 


Thr 


Gly 


Val 


Gly 
265 


Glv 


Thr 


Glv 


He 


Asp 
270 


He 


Pro 


\/^ 1 
vai 


Leu 


Leu 


Leu 


Leu 


He 


Asp 


Gly Asp 


Glu 


Lys 


Met 


Leu 


Thr 


Arg 


He 


























285 










Asn 
290 




THr* 


V3 J. 1 1 


Ala 


Gin 
295 


Leu 


fro 


Cys 


Leu 


Leu 
300 


Val 


Ala 


Glv 


Ser 


oJ.y 


oxy 


MX a 


Axa 


Asp 


Cys 


Leu 


TV T _ 

Ala 


Glu 


Thr 


Leu 


Glu 


Asp 


Thr 


Leu 


Ala 










310 










315 










320 


Pro 


Gly 


Ser 


VaXy 


ijxy 


Ala 


Arg 


Gin 


Gly 


O -L Li 


Ala 




Asp 


Arg 


He 


Arg 


















330 










335 




Ar9 


Phe 


pne 


fro 


i^ys 


Gly Asp 


Leu 


Glu 


v« 1 
Vox 


Leu 


Gin 


Ala 


Gin 


Val 


Glu 






340 










34 5 










350 






Arg 


lie 


Met 


Thr 


Arg 


Lys 


Glu 


Leu 


Leu 




Va 1 
vox 


iyr 


Ser 


Ser 


Glu 


Asp 




355 










360 










365 








Gly 


Ser 


Glu 


Glu 


Phe 


Glu 


Thr 


He 


Val 


J-itr Li 


Lys 


Ala 


Leu 


Val 


Lys 


Ala 


370 










375 




















Cys 


Gly 


Ser 


Ser 


Glu 


Ala 


Ser 


Ala 


Tyr 


Leu 


Asp 


O X Li 


Leu Arg 


Leu 


Ala 


385 








390 




















400 


Val 


Ala 


Trp 


Asn 


Arg 


Val 


Asp 


He 


Ala 


<jxn 


Ser 


VtXU 


Leu 


Phe 


Arg 


Gly 








405 










H XU 










415 




Asp 


He 


Gin 


Trp 


Arg 


Ser 


Phe 


His 


Leu 


VsXU 


nXa 


OCX 


Leu 


Met 


Asp 


Ala 






420 










425 










430 






Leu 


Leu 


Asn 


Asp Arg 


Pro 


Glu 


Phe 


Val 


Arg 


Leu 


Leu 


He 


Ser 


His 


Gly 






4 35 










440 










445 








Leu 


Ser 
450 


Leu 


Gly 


His 


Phe 


Leu 
455 


Thr 


Pro 




Arg 


Leu 


Ala 


Gin 


Leu 


Tyr 


Ser 


Ala 


Ala 


Pro 


Ser 


Asn 


Ser 


Leu 


lie 


Arg 


Asn 


Leu 


Leu Asp Gin Ala 


465 










470 




















480 


Ser 


His 


Ser 


Ala 


Gly Thr Lys 


Ala 


Pro 


nXa 


Leu 


Lys 


Gly 


Gly Ala 


Ala 










485 










H JKJ 










495 




Glu 


Leu 


Arg 


Pro 


Pro 


Asp 


Val 


laXy 


riXS 


V elX 


Leu 


Arg 


Met 


Leu 


Leu 


Gly 






500 










C A c 










510 






Lys 


Met 


Cys 


Ala 


Pro 


Arg 


Tyr 


Pro 


Ser 


\j J- y 


Gly 


Ala 


Trp 


Asp 


Pro 


His 




515 










520 










525 








Pro 


b±y 
s r» 


tjxn 


Gly 


Phe 


Gly 


Glu 
535 


Ser 


Met 




Leu 


Leu 
540 


Ser 


Asp 


Lys 


Ala 


i nr 


C Q r- 

oer 


Pro 


Leu 


Ser 


Leu 


Asp 


Ala 


Gly 


Leu 


Gly 


Gin 


Ala 


Pro 


Trp 


Ser 


545 










550 










555 










560 


Asp 


Leu 


Leu 


Leu 


Trp 


Ala 


Leu 


Leu 


Leu 


Asn 


Arg 


Ala 


Gin 


Met 


Ala 


Met 








565 










570 










575 




Tyr 


rTle 


Trp 


Glu 


Met 


Gly 


Ser 


Asn 


AX a 


Val 


Ser 


Ser 


Ala 


Leu 


Gly 


Ala 




580 










Do D 










590 






Cys 


Leu 


Leu 


Leu 


Arg 


Val 


Met 


Ala 


Arg 




Glu 


Pro 


Asp 


Ala 


Glu 


Glu 




595 










600 










605 








Pi. ±3. 


>vj.a 
610 


Arg 


Arg 


Lys 


Asp 


Leu 
615 


Ala 


Phe 


ys 


Phe 


Glu 
620 


Gly 


Met 


Gly 


Val 


Asp 


Leu 


irne 


Gly 


Glu 


Cys 


Tyr 


Arg 


Ser 


Ser 


Glu 


Val 


Arg 


Ala 


Ala 


Arg 


625 










630 










635 










640 


Leu 


Leu 


Leu 


Arg 


Arg 
645 


Cys 


Pro 


Leu 


Trp 


Gly 
650 


Asp 


Ala 


Thr 


Cys 


Leu 
655 


Gin 


Leu 


Ala 


Met 


Gin 


Ala 


Asp Ala 


Arg 


Ala 


Phe 


Phe 


Ala 


Gin 


Asp 


Gly 


Val 








660 










665 










67 0 






Gin 


Ser 


Leu 
675 


Leu 


Thr 


Gin 


Lys 


Trp 
680 


Trp 


Gly 


Asp 


Met 


Ala 
685 


Ser 


Thr 


Thr 


Pro 


He 


Trp 


Ala 


Leu 


Val 


Leu 


Ala 


Phe 


Phe 


Cys 


Pro 


Pro 


Leu 


He 


Tyr 




690 








695 










700 










Thr Arg 


Leu 


He 


Thr 


Phe 


Arg 


Lys 


Ser 


Glu 


Glu 


Glu 


Pro 


Thr 


Arg 


Glu 


705 










710 










715 










720 
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Glu 


Leu 


Glu 


Phe 


Asp 
725 


Met 


Asp 


Ser 


Val 


He 


Asn 


Gly 


Glu 


Gly 


Pro 

/ jD 


Val 


Gly 


Thr 


Ala 


Asp 


Pro 


Ala 


Glu 


Lys 


Thr 


Pro 


Leu 


Gly 


Val 


Pro 


Arg 


V3±n 






740 










745 










750 






Ser 


Gly 


Arg 
755 


Pro 


Gly 


Cys 


Cys 


Gly 
760 


Gly 


Arg 


Cys 


Gly 


Gly 
765 


Arg 


Arg 


Cys 


Leu 


Arg 
770 


Arg 


Trp 


Phe 


His 


Phe 
775 


Trp 


Gly 


Ala 


Pro 


Val 
780 


Thr 


He 


Phe 


Met 


Gly 


Asn 


Val 


Val 


Ser 


Tyr 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Phe 


Ser 


Arg 


Val 


785 










790 










795 










Q An 
oUU 


Leu 


Leu 


Val 


Asp 


Phe 


Gin 


Pro 


Ala 


Pro 


Pro 


Gly 


Ser 


Leu 


Glu 


Leu 


Leu 








805 










810 










815 




Leu 


Tyr 


Phe 


Trp 


Ala 


Phe 


Thr 


Leu 


Leu 


Cys 


Glu 


Glu 


Leu 


Arg 


Gin 


Gly 






820 










825 










830 






Leu 


Ser 


Gly 
835 


Gly 


Gly 


Gly 


Ser 


Leu 
840 


Ala 


Ser 


Gly 


Gly 


Pro 
845 


Gly 


Pro 


Gly 


His 


Ala 
850 


Ser 


Leu 


Ser 


Gin 


Arg 
855 


Leu 


Arg 


Leu 


Tyr 


Leu 
860 


Ala 


Asp 


Ser 


Trp 


Asn 


Gin 


Cys 


Asp 


Leu 


Val 


Ala 


Leu 


Thr 


Cys 


Phe 


Leu 


Leu 


Gly 


Val 


Gly 


865 










870 










875 










880 


Cys 


Arg 


Leu 


Thr 


Pro 


Gly 


Leu 


Tyr 


His 


Leu 


Gly Arg 


Thr 


Val 


Leu 


Cys 










885 










890 










895 




He 


Asp 


Phe 


Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 






900 










905 










910 






Asn 


Lys 


Gin 


Leu 


Gly 


Pro 


Lys 


He 


Val 


He 


Val 


Ser 


Lys 


Met 


Met 


Lys 




915 










920 










925 








Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Phe 


Leu 


Gly Val 


Trp 


Leu 


Val 


Ala 


Tyr 


930 










935 










94 0 










Gly 


Val 


Ala 


Thr 


Glu 


Gly Leu 


Leu 


Arg 


Pro 


Arg 


Asp 


Ser 


Asp 


Phe 


Pro 


945 










950 










955 










960 


Ser 


He 


Leu 


Arg 


Arg 


Val 


Phe 


Tyr 


Arg 


Pro 


Tyr 


Leu 


Gin 


He 


Phe 


Gly 








965 










970 










975 




Gin 


He 


Pro 


Gin 
980 


Glu 


Asp 


Met 


Asp 


Val 
985 


Ala 


Leu 


Met 


Glu 


His 
990 


Ser 


Asn 


Cys 


Ser 


Ser 


Glu 


Pro 


Gly 


Phe 


Trp 


Ala 


His 


Pro 


Pro 


Gly 


Ala 


Gin 


Ala 




995 










1000 








1005 






Gly 


Thr 


Cys 


Val 


Ser 


Gin 


Tyr 


Ala 


Asn 


Trp 


Leu 


Val 


Val 


Leu 


Leu 


Leu 


1010 








1015 








1020 








Val 


He 


Phe 


Leu 


Leu 


Val 


Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


1025 








1030 








1035 








1040 


Ala 


Met 


Phe 


Ser 


Tyr 


Thr 


Phe 


Gly 


Lys 


Val 


Gin 


Gly 


Asn 


Ser 


Asp 


Leu 










1045 








1050 








1055 


Tyr 


Trp 


Lys 


Ala 


Gin 


Arg 


Tyr 


Arg 


Leu 


He 


Arg 


Glu 


Phe 


His 


Ser 


Arg 








1060 








1065 








1070 




Pro 


Ala 


Leu 


Ala 


Pro 


Pro 


Phe 


He 


Val 


He 


Ser 


His 


Leu 


Arg 


Leu 


Leu 



1075 1080 1085 



Leu Arg Gin Leu Cys Arg Arg Pro Arg Ser Pro Gin Pro Ser Ser Pro 

1090 1095 1100 

Ala Leu Glu His Phe Arg Val Tyr Leu Ser Lys Glu Ala Glu Arg Lys 
1105 1110 1115 1120 

Leu Leu Thr Trp Glu Ser Val His Lys Glu Asn Phe Leu Leu Ala Arg 

1125 1130 1135 

Ala Arg Asp Lys Arg Glu Ser Asp Ser Glu Arg Leu Lys Arg Thr Ser 

1140 1145 1150 

Gin Lys Val Asp Leu Ala Leu Lys Gin Leu Gly His He Arg Glu Tyr 

1155 1160 1165 

Glu Gin Arg Leu Lys Val Leu Glu Arg Glu Val Gin Gin Cys Ser Arg 

1170 1175 1180 

Val Leu Gly Trp Val Ala Glu Ala Leu Ser Arg Ser Ala Leu Leu Pro 
1185 1190 1195 1200 
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Pro Gly Gly Pro Pro Pro Pro Asp Leu Pro Gly Ser Lys Asp 
1205 1210 

<210> 31 

<2il> 4646 

<212> DNA 

<213> Homo Sapiens 



60 
120 
180 



<400> 31 

tcgacccacg cgtccgccca cgcgtccgcc cacgcgtccg cccacgcgtc cgcccacgcg 
tccgcccacg cgtccggggt gaaagmramy cmygcktsms aaaaaccgtc acttaggaaa 
agatgtcctt tcgggcagcc aggctcagca tgaggaacag aaggaatgac actctggaca 
gcacccggac cctgtactcc agcgcgtctc ggagcacaga cttgtcttac agrgaaagcg 240 
acttggtgaa ttttattcaa gcaaatttta agaaacgaga atgtgtcttc tttaccaaag 300 
attccaaggc cacggagaat gtgtgcaagt gtggctatgc ccagagccag cacatggaag 360 
gcacccagat caaccaaagt gagaaatgga actacaagaa acacaccaag gaatttccta 420 
ccgacgcctt tggggatatt cagtttgaga cactggggaa gaaagggaag tatatacgtc 480 
tgtcctgcga cacggacgcg gaaatccttt acgagctgct gacccagcac tggcacctga 
aaacacccaa cctggtcatt tctgtgaccg ggggcgccaa gaacttcgcc ctgaagccgc 
gcatgcgcaa gatcttcagc cggctcatct acatcgcgca gtccaaaggt gcttggattc 
tcacgggagg cacccattat ggcctgatga agtacatcgg ggaggtggtg agagataaca 
ccatcagcag gagttcagag gagaatattg tggccattgg catagcagct tggggcatgg 
tctccaaccg ggacaccctc atcaggaatt gcgatgctga gggctatttt ttagcccagt 
accttatgga tgacttcaca agagatccac tgtgtatcct ggacaacaac cacacacatt 
tgctgctcgt ggacaatggc tgtcatggac atcccactgt cgaagcaaag ctccggaatc 
agctagagaa gtatatctct gagcgcacta ttcaagattc caactatggt ggcaagatcc 
ccattgtgtg ttttgcccaa ggaggtggaa aagagacttt gaaagccatc aatacctcca 
tcaaaaataa aattccttgt gtggtggtgg aaggctcggg ccagatcgct gatgtgatcg 
ctagcctggt ggaggtggag gatgccctga catcttctgc cgtcaaggag aagctggtgc 
gctttttacc ccgcacggtg tcccggctgc ctgaggagga gactgagagt tggatcaaat 
ggctcaaaga aattctcgaa tgttctcacc tattaacagt tattaaaatg gaagaagctg 
gggatgaaat tgtgagcaat gccatctcct acgctctata caaagccttc agcaccagtg 
agcaagacaa ggataactgg aatgggcagc tgaagcttct gctggagtgg aaccagctgg 
acttagccaa tgatgagatt ttcaccaatg accgccgatg ggagtctgct gaccttcaag 
aagtcatgtt tacggctctc ataaaggaca gacccaagtt tgtccgcctc tttctggaga 
atggcttgaa cctacggaag tttctcaccc atgatgtcct cactgaactc ttctccaacc 
acttcagcac gcttgtgtac cggaatctgc agatcgccaa gaattcctat aatgatgccc 
tcctcacgtt tgtctggaaa ctggttgcga acttccgaag aggcttccgg aaggaagaca 
gaaatggccg ggacgagatg gacatagaac tccacgacgt gtctcctatt actcggcacc 
ccctgcaagc tctcttcatc tgggccattc ttcagaataa gaaggaactc tccaaagtca 
tttgggagca gaccaggggc tgcactctgg cagccctggg agccagcaag cttctgaaga 
ctctggccaa agtgaagaac gacatcaatg ctgctgggga gtccgaggag ctggctaatg 
agtacgagac ccgggctgtt gagctgttca ctgagtgtta cagcagcgat gaagacttgg 
cagaacagct gctggtctat tcctgtgaag cttggggtgg aagcaactgt ctggagctgg 
cggtggaggc cacagaccag catttcatcg cccagcctgg ggtccagaat tttctttcta 
agcaatggta tggagagatt tcccgagaca ccaagaactg gaagattatc ctgtgtctgt 
ttattatacc cttggtgggc tgtggctttg tatcatttag gaagaaacct gtcgacaagc 
acaagaagct gctttggtac tatgtggcgt tcttcacctc ccccttcgtg gtcttctcct 
ggaatgtggt cttctacatc gccttcctcc tgctgtttgc ctacgtgctg ctcatggatt 
tccattcggt gccacacccc cccgagctgg tcctgtactc gctggtcttt gtcctcttct 
gtgatgaagt gagacagtgg tacgtaaatg gggtgaatta ttttactgac ctgtggaatg 
tgatggacac gctggggctt ttttacttca tagcaggaat tgtatttcgg ctccactctt 
ctaataaaag ctctttgtat tctggacgag tcattttctg tctggactac attattttca 
ctctaagatt gatccacatt tttactgtaa gcagaaactt aggacccaag attataatgc 
tgcagaggat gctgatcgat gtgttcttct tcctgttcct ctttgcggtg tggatggtgg 
cctttggcgt ggccaggcaa gggatcctta ggcagaatga gcagcgctgg aggtggatat 
tccgttcggt catctacgag ccctacctgg ccatgttcgg ccaggtgccc agtgacgtgg 
atggtaccac gtatgacttt gcccactgca ccttcactgg gaatgagtcc aagccactgt 
gtgtggagct ggatgagcac aacctgcccc ggttccccga gtggatcacc atccccctgg 
tgtgcatcta catgttatcc accaacatcc tgctggtcaa cctgctggtc gccatgtttg 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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gctacacggt 
tcctggtgca 
acttctacat 
cttctgtctg 
aggaaaacta 
atcgatttag 
ctaataaaat 
tcatattaag 
acccctgggt 
aagtgtgtga 
gtttctccag 
agtttaagtg 
tataggagaa 
ttttgttcat 
ttggcatatt 
ggaccccgct 
ggaggtgaca 
cccccattcc 
tttgttctta 
attatagaaa 
taaggtaatc 
tgtctcaatt 
gtaagctttt 
aacattatat 
tgaaggctat 
tatttgggaa 
aaaaaaaagg 



gggcaccgtc 
ggagtactgc 
ggtggtgaag 
ctgtttcaaa 
ccttgtcaag 
acaactggat 
caaataaaac 
gaatgctgat 
acatggtgga 
ttggtttcat 
aatggtgcct 
tgttcttacc 
catctatcct 
ttccaattga 
gtwaaaagtc 
ctcttcagga 
aggcagtctc 
taaatatgtg 
ttggatactc 
atttagacca 
actgctaaca 
ctctttcaaa 
tcacttagta 
caataattgc 
ctccagttga 
aattttccaa 
gcggccgctc 



caggagaaca 
agccgcctca 
aagtgcttca 
aatgaagaca 
atcaacacaa 
acaaagctta 
tgtatgaact 
gaacaatttt 
tgattttaaa 
acttgaagac 
gtttctctct 
gcctcctttt 
atgaataaga 
ttctctactt 
tctcaaatta 
aaagtgtttt 
ttgctctctt 
aaaagtcgcc 
ctcttattta 
tacagagatg 
atttctggat 
attttacaga 
ttttatcaaa 
ataataggca 
tcattgggat 
ggttagattc 
tagagt 



-50- 

atgaccaggt 
atatcccctt 
agtgttgctg 
atgagactct 
aagccaacga 
atgatctcaa 
ctaatggaga 
gctatcgact 
tcaccctagt 
ggatataaag 
gtgtctcaat 
tcctttaatc 
acctggtcat 
ttcccttttt 
ggccagattc 
catttctcag 
ggactcacca 
caaaatgcaa 
ttatttttcc 
tagaaagaac 
ggtttttcaa 
atgttatcat 
tatgttttta 
acctctagcg 
gagcatcttt 
caataaatat 



ctggaagttc 
ccccttcatc 
caaggagaaa 
ggcatgggag 
cacctcagag 
gggtcttctg 
aaaatctaat 
actaaatgag 
gtgctgagac 
gaagaatatt 
gcctgggact 
ttatttttga 
gctttactcc 
tgtattatgt 
taaaacatgc 
gatgcttctt 
ggctcctatt 
ccttgaaagg 
attaaaaata 
ataaattgtc 
gtctattttt 
actacatata 
ttatattcat 
attaccataa 
gtgcatgaat 
ctatttatta 



cagaggtact 

gtcttcgctt 

aacatggagt 

ggtgtcatga 

gaaatgaggc 

aaagagattg 

tatagcaaga 

agattttcag 

cttgagaata 

tcctttatgt 

ggaggttgat 

tgaacacata 

tgtattgtta 

gactaattag 

tgcagcaaga 

acctgtcaga 

gaaggaacca 

cactactgac. 

atagctggct 

cccattacct 

tttctatgta 

tactttttat 

agccttctta 

ttttgctcat 

cctattgctg 

titaaaaaaaa 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4646 





<210> 


32 




























<211> 


1104 


























<212> 


PRT 




























<213> 


Homo Sapiens 






















<400> 


32 


























Met 


Ser 


Phe 


Arg 


Ala 


Ala 


Arg 


Leu 


Ser 


Met 


Arg 


Asn 


Arg 


Arg 


Asn 


Asp 


1 








5 










10 










15 




Thr 


Leu 


Asp 


Ser 


Thr 


Arg 


Thr 


Leu 


Tyr 


Ser 


Ser 


Ala 


Ser 


Arg 


Ser 


Thr 






20 










25 










30 






Asp 


Leu 


Ser 


Tyr 


Ser 


Glu 


Ser 


Asp 


Leu 


Val 


Asn 


Phe 


He 


Gin 


Ala 


Asn 




35 










40 










45 








Phe 


Lys 


Lys 


Arg 


Glu 


Cys 


Val 


Phe 


Phe 


Thr 


Lys 


Asp 


Ser 


Lys 


Ala 


Thr 




50 










55 










60 










Glu 


Asn 


Val 


Cys 


Lys 


Cys 


Gly 


Tyr 


Ala 


Gin 


Ser 


Gin 


His 


Met 


Glu 


Gly 


65 










70 










75 










80 


Thr 


Gin 


He 


Asn 


Gin 


Ser 


Glu 


Lys 


Trp 


Asn 


Tyr 


Lys 


Lys 


His 


Thr 


Lys 










85 










90 










95 




Glu 


Phe 


Pro 


Thr 


Asp 


Ala 


Phe 


Gly 


Asp 


He 


Gin 


Phe 


Glu 


Thr 


Leu 


Gly 








100 










105 










110 






Lys 


Lys 


Gly 


Lys 


Tyr 


He 


Arg 


Leu 


Ser 


Cys 


Asp 


Thr 


Asp 


Ala 


Glu 


He 






115 










120 










125 








Leu 


Tyr 


Glu 


Leu 


Leu 


Thr 


Gin 


His 


Trp 


His 


Leu 


Lys 


Thr 


Pro 


Asn 


Leu 




130 










135 










140 










Val 


He 


Ser 


Val 


Thr 


Gly 


Gly 


Ala 


Lys 


Asn 


Phe 


Ala 


Leu 


Lys 


Pro 


Arg 


145 










150 










155 










160 


Met 


Arg 


Lys 


He 


Phe 


Ser 


Arg 


Leu 


He 


Tyr 


He 


Ala 


Gin 


Ser 


Lys 


Gly 






165 










170 










175 




Ala 


Trp 


He 


Leu 


Thr 


Gly 


Gly 


Thr 


His 


Tyr 


Gly 


Leu 


Met 


Lys 


Tyr 


He 






180 










185 










190 






Gly Glu 


Val 


Val 


Arg 


Asp 


Asn 


Thr 


He 


■ Ser 


Arg 


Ser 


Ser 


Glu 


Glu 


Asn 






195 










200 










205 
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T 1 o 




Ala 


lie 


Glv 


He 


Ala 


Ala 


Trp 


Gly 


Met 


Val 


Ser 


Asn 


Arg 


Asp 




210 










215 










220 












Leu 


lie 


Arg 


Asn 


Cys 


Asp 


Ala 


Glu 


Gly Tyr 


Phe 


Leu 


Ala 


Gin 


Tyr 


225 








230 










235 










240 


L611 


Met. 


Asp 


Asp 


Phe 


Thr 


Arg 


Asp 


Pro 


Leu 


Cys 


He 


Leu Asp 


Asn 


Asn 








245 










250 










255 




U { 19 

nl.S 




nxs 


Leu 


Leu 


Leu 


Val 


Asp 


Asn 


Gly 


Cys 


His 


oxy 


nxs 


Pro 


Thr 








260 










265 
















vaj. 


IsXU 


AX a 


Lys 


Leu 


Arg 


Asn 


Gin 


Leu 


Glu 


Lys 


Tvr 


T 1 o 

X xe 




Glu 


Ara 






Z 1 o 








280 


















Thr 


xxe 


uxn 


Asp 


Ser 


sn 


i yi 


Gly Gly Lys 


X Xc 


Pro 


He 


vax 


Cys 


Phe 












295 










300 










AX a 




oxy 


Gly Gly 


Lys 


Glu 


Thr 


Leu 


Lys 




He 


Asn 


1 nr 


Ser 


He 


305 








310 










JXj 










320 


Lys 


Asn 


Lys 


He 


Pro 


Cys 


Va J. 


Val 


Val 


Glu 


ux y 


Ser 


Gly 


uxn 


He 


Ala 






325 










330 










335 




Asp 


Val 


ixe 


Ala 


Ser 


Leu 


va± 


Glu 


Val 


Glu 


Asp 


Ala 


Leu 


1 nr 


Ser 


Ser 






340 










34 5 
















Ala 


Val 


Lys 


Glu 


Lys 


Leu 




Arg 


Phe 


Leu 


Pro 


Arg 


Thr 


Vox 


Ser 


Arg 
















360 










365 








Leu 


Pro 


GXU 


Glu 


Glu 


inr 




Ser 


Trp 


He 


Lys 


Trp 


Leu 


Xiys 


Glu 


He 




J /U 










■J i ZJ 










380 










Leu 


Glu 


Cys 


Ser 


His 


Leu 


Leu 


Thr 


Val 


He 


Lys 


Met 


Glu 


uXU 


Ala 


Glv 


385 








390 




















400 


Asp 


Glu 


xxe 


Val 


Ser 


Asn 


AX a 


He 


Ser 


Tyr 


AVXo. 


Leu 


Tyr 


Lys 


Ala 


Phe 








4 05 










410 










415 




Seio 


Tnr 


oer 


Glu 


Gin 


Asp 


Lys 


TV r> v-v 

ASp 


Asn 


Trp 


Asn 


Gly Gin 


Leu 


Lys 


Leu 








420 


























Leu 


Leu 


uXU 


Trp 


Asn 


Gin 


Leu 


Asp 


Leu 


Axa 


Asn 


Asp 


Glu 


X Xe 


Phe 


Thr 
















4 4 U 










445 








Asn 


Asp 


Ajrg 


Arg 


Trp 


Glu 


Ser 


Ala 


Asp 


Leu 


UXXl 


Glu 


Val 


Mat* 


Phe 


Thr 




hOKJ 










455 










4 60 










AX a 


Leu 


xxe 


Lys 


Asp Arg 


Pro 


Lys 


pne 


vax 


Arg 


Leu 


Phe 


Leu 


Glu 


Asn 


4 DO 










470 




















480 


GXy 


lieu 


Asn 


Leu 


Arg 


Lys 




Leu 


Tnr 


HXS 


Asp 


Val 


Leu 


1 nx 


Glu 


Leu 








485 










A on 

4 yu 










495 




rrne 


oe jt 


Asn 


His 


Phe 


Ser 


Thr 


Leu 


Val 


Tvr 


rg 


Asn 


Leu 


UXlI 


He 


Ala 








500 










505 










^ Xu 






Lys 


Asn 


Q o T- 


Tyr 


Asn 


Asp 


Ala 


Leu 


Leu 


Thr 


Phe 


Val 


Trp 


Lys 


Leu 


Val 




J J. ^ 










520 










525 








/ixa 


Asn 


irne 


Arg 


Arg 


Gly 


Phe 


Arg 


Lys 


Glu 


Asp 


Arg 


Asn 


CZl XT 


Arg 


Asp 














535 










540 










p 1 11 




Asp 


He 


Glu 


Leu 


His 


Asp 


Val 


Ser 


Pro 


He 


Thr 


Arg 


His 


Pro 


DH D 








550 










555 










560 


Leu 






Leu 


Phe 


He 


T T~o 
1 rp 


Ala 


He 


Leu 


Gin 


Asn 


Lys 


Lys 


Glu 


Leu 










565 










570 










575 




Seir 


Lys 


Val 


He 


Trp 


Glu 


Gin 


Thr 


Arg 


Gly 


Cys 


Thr 


Leu 


Ala 


Ala 


Leu 






580 










585 










590 






Gly 


Ala 


Ser 


Lys 


Leu 


Leu 


Lys 


Thr 


Leu 


Ala 


Lys 


Val 


Lys 


Asn 


Asp 


He 




595 










600 










605 








Asn 


Ala 


Ala 


Gly 


Glu 


Ser 


Glu 


Glu 


Leu 


Ala 


Asn 


Glu 


Tyr 


Glu 


Thr 


Arg 




610 








615 










620 










Ala 


Val 


Glu 


Leu 


Phe 


Thr 


Glu 


Cys 


Tyr 


Ser 


Ser 


Asp 


Glu 


Asp 


Leu 


Ala 


625 










630 










635 










640 


Glu 


Gin 


Leu 


Leu 


Val 


Tyr 


Ser 


Cys 


Glu 


Ala 


Trp 


Gly 


Gly 


Ser 


Asn 


Cys 










645 










650 










655 




Leu 


Glu 


Leu 


Ala 


Val 


Glu 


Ala 


Thr 


Asp 


Gin 


His 


Phe 


He 


Ala 


Gin 


Pro 








660 










665 










670 






Gly 


Val 


Gin 


Asn 


Phe 


Leu 


Ser 


Lys 


Gin 


Trp 


Tyr 


Gly 


Glu 


He 


Ser 


Arg 



675 680 685 
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Asp 


Thr 


Lys 


Asn 


Trp 


Lys 


He 


He 


Leu 


Cys 


Leu 


Phe 


He 


He 


Pro 


Leu 




690 










695 










700 










Val 


Gly 


Cys 


Gly 


Phe 


Vai 


Ser 


Phe 


Arg 


Lys 


Lys 


Pro 


Val 


Asp 


Lys 


His 


705 










710 










715 










720 


Lys 


Lys 


Leu 


Leu 


Trp 


Tyr 


Tyr 


Val 


Ala 


Phe 


Phe 


Thr 


Ser 


Pro 


Phe 


Val 










725 










730 










735 




Val 


Phe 


Ser 


Trp 


Asn 


Val 


Val 


Phe 


Tyr 


He 


Ala 


Phe 


Leu 


Leu 


Leu 


Phe 








740 










745 










750 






Ala 


Tyr 


Val 


Leu 


Leu 


Met 


Asp 


Phe 


His 


Ser 


Val 


Pro 


His 


Pro 


Pro 


Glu 




755 










760 










765 








Leu 


Val 


Leu 


Tyr 


Ser 


Leu 


Val 


Phe 


Val 


Leu 


Phe 


Cys 


Asp 


Glu 


Val 


Arg 




770 










775 










780 










Gin 


Trp 


Tyr 


Val 


Asn 


Gly Val 


Asn 


Tyr 


Phe 


Thr 


Asp 


Leu 


Trp Asn 


Vai 


785 










790 










795 










800 


Met 


Asp 


Thr 


Leu 


Gly 


Leu 


Phe 


Tyr 


Phe 


He 


Ala 


Gly 


He 


Val 


Phe 


Arg 










805 










810 










815 




Leu 


His 


Ser 


Ser 


Asn 


Lys 


Ser 


Ser 


Leu 


Tyr 


Ser 


Gly 


Arg 


Val 


He 


Phe 








820 










825 










830 






Cys 


Leu 


Asp 


Tyr 


He 


He 


Phe 


Thr 


Leu Arg 


Leu 


He 


His 


He 


Phe 


Thr 






835 










840 










845 








Val 


Ser 


Arg 


Asn 


Leu 


Gly 


Pro 


Lys 


He 


He 


Met 


Leu 


Gin 


Arg 


Met 


Leu 




850 










855 










860 










He 


Asp 


Val 


Phe 


Phe 


Phe 


Leu 


Phe 


Leu 


Phe 


Ala 


Val 


Trp 


Met 


Val 


Ala 


865 








870 










875 










880 


Phe 


Gly 


Val 


Ala 


Arg 


Gin 


Gly 


He 


Leu 


Arg 


Gin 


Asn 


Glu 


Gin 


Arg 


Trp 










885 










890 










895 




Arg 


Trp 


He 


Phe 


Arg 


Ser 


Val 


He 


Tyr 


Glu 


Pro 


Tyr 


Leu 


Ala 


Met 


Phe 








900 










905 










910 






Gly 


Gin 


Val 


Pro 


Ser 


Asp 


Vai 


Asp 


Gly 


Thr 


Thr 


Tyr 


Asp 


Phe 


Ala 


His 






915 










920 










925 








Cys 


Thr 


Phe 


Thr 


Gly 


Asn 


Glu 


Ser 


Lys 


Pro 


Leu 


Cys 


Val 


Glu 


Leu 


Asp 




930 










935 










940 










Glu 


His 


Asn 


Leu 


Pro 


Arg 


Phe 


Pro 


Glu 


Trp 


He 


Thr 


He 


Pro 


Leu 


Val 


945 










950 










955 










960 


Cys 


He 


Tyr 


Met 


Leu 


Ser 


Thr 


Asn 


He 


Leu 


Leu 


Vai 


Asn 


Leu 


Leu 


Val 










965 










970 










97 5 




Ala 


Met 


Phe 


Gly 


Tyr 


Thr 


Vai 


Gly 


Thr 


Val 


Gin 


Glu 


Asn 


Asn 


Asp 


Gin 








980 










985 










990 






Val 


Trp 


Lys 


Phe 


Gin 


Arg 


Tyr 


Phe 


Leu 


Val 


Gin 


Glu 


Tyr 


Cys 


Ser 


Arg 






995 










1000 








1005 






Leu 


Asn 


He 


Pro 


Phe 


Pro 


Phe 


He 


Val 


Phe 


Ala 


Tyr 


Phe 


Tyr 


Met 


Vai 




1010 








1015 








1020 








Val 


Lys 


Lys 


Cys 


Phe 


Lys 


Cys 


Cys 


Cys 


Lys 


Glu 


Lys 


Asn 


Met 


Glu 


Ser 


1025 








1030 








1035 








104 


Ser 


Val 


Cys 


Cys 


Phe 


Lys 


Asn 


Glu 


Asp 


Asn 


Glu 


Thr 


Leu 


Ala 


Trp 


Glu 










1045 








1050 








1055 


Gly 


Val 


Met 


Lys 


Glu 


Asn 


Tyr 


Leu 


Val 


Lys 


He 


Asn 


Thr 


Lys 


Ala 


Asn 








1060 








1065 








1070 




Asp 


Thr 


Ser 


Glu 


Glu 


Met 


Arg 


His 


Arg 


Phe 


Arg 


Gin 


Leu 


Asp 


Thr 


Lys 



1075 1080 1085 



Leu Asn Asp Leu Lys Gly Leu Leu Lys Glu He Ala Asn Lys He Lys 
1090 1095 1100 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 
This rmemalional Search Report has not been established in respect of certain ciaims under Article 17(2Xa) for the foDowing reasons: 
1. rn aaimsNos.: 

— because they relate to subject matter not required to be searched by this Authority, name^ 



pn ClafmsNos.: 

— because they relate to parts of the International AppDcation that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



a n aa^sNos.: 

— because they are dependent claims and are not drafted in accordance with the second and thini sentences of Rule 6.4(a). 

Box 11 Observations where unity of Invention Is lactcing (Continuation of Item 2 of first sheet) 

This Inten^tiona] Searching Authority found multiple Inventions in this International application, as follows: 

see additional sheet 

As a result of the prior review under R. 40.2(e) PCT, 
no additional fees are to be refunded. 

1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
* — ' searchable claims. 

2. As all searchable claims oould be sean:hed without effort justifying an additional fee. this Authority did not invite payment 
of any additional fee. 



3. nn As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
oovere only tfiose claims for which fees were paid, specifically claims Nos.: 

1-36 



4. I I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the daims; it is covered by claims Nos.: 



Remark on Protest [ X | The additional search fees were accompanied by the applicant's protest. 

I I No protest accompanied the payment of additional search fees. 
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Present claims 1-5, 10-13, 16-19, 32-35 relate to an extremely large 
number of possible polynucleotides, polypeptides encoded by them, binding 
polypeptides, and kits and pharmaceutical compositions containing said 
polypeptides and polynucleotides. Support within the meaning of Article 6 
PCT and/or disclosure within the meaning of Article 5 PCI is to be found, 
however, for only a very small proportion of the compounds claimed. In 
the present case, the claims so lack support, and the application so 
lacks disclosure, that a meaningful search over the whole of the claimed 
scope is impossible. Consequently, the search has been carried out for 
those parts of the claims which appear to be supported and disclosed, 
namely those parts relating to the polynucleotide of SEQ ID NOs: 1, 27, 
29 and 31 and the corresponding polypeptide of SEQ ID NOs: 2, 28, 30 and 
32. 

Present claims 16 and 17 relate to an extremely large number of possible 
compounds, namely, a polypeptide that binds to the polypeptide of the 
invention. Support within the meaning of Article 6 PCT and/or disclosure 
within the meaning of Article 5 PCT is to be found, however, for only a 
very small proportion of the compounds claimed. In the present case, the 
claims so lack support, and the application so lacks disclosure, that a 
meaningful search over the whole of the claimed scope is impossible. 
Consequently, the search has been carried out for those parts of the 
claims which appear to be supported and disclosed, namely those parts 
relating to an antibody, antibody fragment, F(ab)2 fragment or a fragment 
including a CDR3 region selective for the polypeptides of the invention. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to Inventions in respect of which no international 
search report has been established need not be the subject of an 
International preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1, 6-36 (partially) and 2 (complete) 

An isolated nucleic acid molecule comprising a nucleic acid 
molecule that hybridizes to a nucleic acid molecule of SEQ 
ID N0:1 and which code for a SOC/CRAC polypeptide, nucleic 
acid molecules that differ in codon sequence due to 
degeneracy of the genetic code and complement thereof, 
polynucleotides which are not identical to the SEQ ID or 
sequences of GenBank accession number of Table 1; expression 
vector, host cells; polypeptide encoded thereof (SEQ ID 
N0:2); polypeptides binding to the polypeptide of SEQ ID 
N0:2, including antibodies; kits comprising agents that 
selectively bind to the polynucleotide (SEQ ID N0:1) or 
polypeptide (SEQ ID N0:2) of the Invention; pharmaceutical 
compositions containing the polynucleotide or polypeptides 
of the invention; a method for isolating the SOC/CRAC 
molecule having SOC/CRAC calcium channel activity comprising 
contacting a binding molecule that is SOC/CRAC nucleic acid 
or a SOC/CRAC binding polypeptide with a sample containing 
SOC/CRAC molecules allowing the formation of the complex, 
detecting the formation of the complex, isolating the 
SOC/CRAC molecule and determining whether the isolated 
SOC/CRAC molecule has SOC/CRAC calcium channel activity; a 
method for identifying agents useful in the modulation of 
SOC/CRAC calcium channel activity; a method to determine the 
level of SOC/CRAC expression in a subject. Including 
expression of SOC/CRAC polypeptide or mRNA in a tissue or 
biological fluid sample using PCR, Northern blotting, and 
mono- and polyclonal ant 1 sera and a method for identifying 
agents useful in the modulation of the SOC/CRAC polypeptide 
kinase activity, comprising the use of aminoacids 999-1180 
from SEQ ID NO: 4 as a candidate kinase. 



2. Claims: 1,6-36 (partially) 

As subject 1, but referred to the polynucleotide of SEQ ID 
N0:3 and to the encoded polypeptide of SEQ ID N0:4 



3, Claims: 1,6-36 (partially) 

As subject 1, but referred to the polynucleotide of SEQ ID 
NO: 5 and to the encoded polypeptide of SEQ ID N0:6 



4. Claims: 1,6-36 (partially) 

As subject 1, but referred to the polynucleotide of SEQ ID 
NO: 7 and to the encoded polypeptide of SEQ ID N0:8 
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5. Claims: 1,6-36 (partially) and 37 (complete) 

As subject 1, but referred to the polynucleotide of SEQ ID 
NO: 23 and to the encoded polypeptide of SEQ ID NO: 24 

6. Claims: 1,6-36 (partially) 

As subject 1, but referred to the polynucleotide of SEQ ID 
N0:25 and to the encoded polypeptide of SEQ ID N0:26 

7. Claims: 1,10-36 (partially) and 3 (complete) 

As subject 1, but referred to the polynucleotide of SEQ ID 
NO: 27 and to the encoded polypeptide of SEQ ID NO: 28 

8. Claims: 1,6-36 (partially) and 4 (complete) 

As subject 1, but referred to the polynucleotide of SEQ ID 
N0:29 and to the encoded polypeptide of SEQ ID N0:30 

9. Claims: 1,6-36 (partially) and 5 (complete) 

As subject 1, but referred to the polynucleotide of SEQ ID 
N0:31 and to the encoded polypeptide of SEQ ID NO: 32. 
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